Saved in:
| Main Authors: | Xiao, Aoran, Cheng, Shihao, Xu, Yonghao, Ren, Yexian, Chen, Hongruixuan, Yokoya, Naoto |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08896 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MM-OVSeg:Multimodal Optical-SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing
by: Wei, Yimin, et al.
Published: (2026)
by: Wei, Yimin, et al.
Published: (2026)
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
by: Wei, Yimin, et al.
Published: (2025)
by: Wei, Yimin, et al.
Published: (2025)
Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM)
by: Chen, Hongruixuan, et al.
Published: (2024)
by: Chen, Hongruixuan, et al.
Published: (2024)
ChangeMamba: Remote Sensing Change Detection With Spatiotemporal State Space Model
by: Chen, Hongruixuan, et al.
Published: (2024)
by: Chen, Hongruixuan, et al.
Published: (2024)
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery
by: Song, Jian, et al.
Published: (2024)
by: Song, Jian, et al.
Published: (2024)
A Vision Centric Remote Sensing Benchmark
by: Adejumo, Abduljaleel, et al.
Published: (2025)
by: Adejumo, Abduljaleel, et al.
Published: (2025)
Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark
by: Broni-Bediako, Clifford, et al.
Published: (2024)
by: Broni-Bediako, Clifford, et al.
Published: (2024)
Foundation Models for Remote Sensing and Earth Observation: A Survey
by: Xiao, Aoran, et al.
Published: (2024)
by: Xiao, Aoran, et al.
Published: (2024)
Enhancing Monocular Height Estimation via Sparse LiDAR-Guided Correction
by: Song, Jian, et al.
Published: (2025)
by: Song, Jian, et al.
Published: (2025)
VLM2GeoVec: Toward Universal Multimodal Embeddings for Remote Sensing
by: Aimar, Emanuel Sánchez, et al.
Published: (2025)
by: Aimar, Emanuel Sánchez, et al.
Published: (2025)
A Survey of Sample-Efficient Deep Learning for Change Detection in Remote Sensing: Tasks, Strategies, and Challenges
by: Ding, Lei, et al.
Published: (2025)
by: Ding, Lei, et al.
Published: (2025)
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing
by: Zhang, Zilun, et al.
Published: (2025)
by: Zhang, Zilun, et al.
Published: (2025)
Segment Anything with Multiple Modalities
by: Xiao, Aoran, et al.
Published: (2024)
by: Xiao, Aoran, et al.
Published: (2024)
DisasterM3: A Remote Sensing Vision-Language Dataset for Disaster Damage Assessment and Response
by: Wang, Junjue, et al.
Published: (2025)
by: Wang, Junjue, et al.
Published: (2025)
Towards Realistic Remote Sensing Dataset Distillation with Discriminative Prototype-guided Diffusion
by: Xu, Yonghao, et al.
Published: (2026)
by: Xu, Yonghao, et al.
Published: (2026)
ObjFormer: Learning Land-Cover Changes From Paired OSM Data and Optical High-Resolution Imagery via Object-Guided Transformer
by: Chen, Hongruixuan, et al.
Published: (2023)
by: Chen, Hongruixuan, et al.
Published: (2023)
OpenEarthMap-SAR: A Benchmark Synthetic Aperture Radar Dataset for Global High-Resolution Land Cover Mapping
by: Xia, Junshi, et al.
Published: (2025)
by: Xia, Junshi, et al.
Published: (2025)
Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery
by: Tsujimoto, Mai, et al.
Published: (2025)
by: Tsujimoto, Mai, et al.
Published: (2025)
CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation
by: Gong, Ziyang, et al.
Published: (2024)
by: Gong, Ziyang, et al.
Published: (2024)
Experience-Driven Multi-Agent Systems Are Training-free Context-aware Earth Observers
by: Dai, Pengyu, et al.
Published: (2026)
by: Dai, Pengyu, et al.
Published: (2026)
ChangeBridge: Spatiotemporal Image Generation with Multimodal Controls for Remote Sensing
by: Zhao, Zhenghui, et al.
Published: (2025)
by: Zhao, Zhenghui, et al.
Published: (2025)
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
by: Wu, Yuhang, et al.
Published: (2024)
by: Wu, Yuhang, et al.
Published: (2024)
GeoHeight-Bench: Towards Height-Aware Multimodal Reasoning in Remote Sensing
by: Hu, Xuran, et al.
Published: (2026)
by: Hu, Xuran, et al.
Published: (2026)
On the Adversarial Vulnerabilities of Transfer Learning in Remote Sensing
by: Bai, Tao, et al.
Published: (2025)
by: Bai, Tao, et al.
Published: (2025)
Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM
by: Fang, Xinyu, et al.
Published: (2025)
by: Fang, Xinyu, et al.
Published: (2025)
GMAI-MMBench: A Comprehensive Multimodal Evaluation Benchmark Towards General Medical AI
by: Chen, Pengcheng, et al.
Published: (2024)
by: Chen, Pengcheng, et al.
Published: (2024)
Building Extraction from Remote Sensing Imagery under Hazy and Low-light Conditions: Benchmark and Baseline
by: Sang, Feifei, et al.
Published: (2026)
by: Sang, Feifei, et al.
Published: (2026)
Bridging Supervision Gaps: A Unified Framework for Remote Sensing Change Detection
by: Jiang, Kaixuan, et al.
Published: (2026)
by: Jiang, Kaixuan, et al.
Published: (2026)
Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models
by: Yu, Weikang, et al.
Published: (2023)
by: Yu, Weikang, et al.
Published: (2023)
GeoR-Bench: Evaluating Geoscience Visual Reasoning
by: Zheng, Yushuo, et al.
Published: (2026)
by: Zheng, Yushuo, et al.
Published: (2026)
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images
by: Han, Chengxi, et al.
Published: (2024)
by: Han, Chengxi, et al.
Published: (2024)
GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing
by: Shabbir, Akashah, et al.
Published: (2025)
by: Shabbir, Akashah, et al.
Published: (2025)
SpectralGPT: Spectral Remote Sensing Foundation Model
by: Hong, Danfeng, et al.
Published: (2023)
by: Hong, Danfeng, et al.
Published: (2023)
SARU: A Shadow-Aware and Removal Unified Framework for Remote Sensing Images with New Benchmarks
by: Bo, Zi-Yang, et al.
Published: (2026)
by: Bo, Zi-Yang, et al.
Published: (2026)
Direction-aware 3D Large Multimodal Models
by: Liu, Quan, et al.
Published: (2026)
by: Liu, Quan, et al.
Published: (2026)
Flooding Regularization for Stable Training of Generative Adversarial Networks
by: Yahiro, Iu, et al.
Published: (2023)
by: Yahiro, Iu, et al.
Published: (2023)
AdaptMMBench: Benchmarking Adaptive Multimodal Reasoning for Mode Selection and Reasoning Process
by: Zhang, Xintong, et al.
Published: (2026)
by: Zhang, Xintong, et al.
Published: (2026)
Change Guiding Network: Incorporating Change Prior to Guide Change Detection in Remote Sensing Imagery
by: Han, Chengxi, et al.
Published: (2024)
by: Han, Chengxi, et al.
Published: (2024)
GeoMeld: Toward Semantically Grounded Foundation Models for Remote Sensing
by: Hasan, Maram, et al.
Published: (2026)
by: Hasan, Maram, et al.
Published: (2026)
Unsupervised Domain Adaptation Architecture Search with Self-Training for Land Cover Mapping
by: Broni-Bediako, Clifford, et al.
Published: (2024)
by: Broni-Bediako, Clifford, et al.
Published: (2024)
Similar Items
-
MM-OVSeg:Multimodal Optical-SAR Fusion for Open-Vocabulary Segmentation in Remote Sensing
by: Wei, Yimin, et al.
Published: (2026) -
SARLANG-1M: A Benchmark for Vision-Language Modeling in SAR Image Understanding
by: Wei, Yimin, et al.
Published: (2025) -
Change Detection Between Optical Remote Sensing Imagery and Map Data via Segment Anything Model (SAM)
by: Chen, Hongruixuan, et al.
Published: (2024) -
ChangeMamba: Remote Sensing Change Detection With Spatiotemporal State Space Model
by: Chen, Hongruixuan, et al.
Published: (2024) -
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery
by: Song, Jian, et al.
Published: (2024)