Saved in:
| Main Authors: | Zhou, Yunqi, Jiang, Chengjie, Yuan, Chun, Li, Jing |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.20460 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming
by: Zhou, Yue, et al.
Published: (2026)
by: Zhou, Yue, et al.
Published: (2026)
Adaptive Chain-of-Focus Reasoning via Dynamic Visual Search and Zooming for Efficient VLMs
by: Zhang, Xintong, et al.
Published: (2025)
by: Zhang, Xintong, et al.
Published: (2025)
Inject Where It Matters: Training-Free Spatially-Adaptive Identity Preservation for Text-to-Image Personalization
by: Li, Guandong, et al.
Published: (2026)
by: Li, Guandong, et al.
Published: (2026)
GRASP: Geospatial pixel Reasoning viA Structured Policy learning
by: Jiang, Chengjie, et al.
Published: (2025)
by: Jiang, Chengjie, et al.
Published: (2025)
AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement
by: Pei, Siqi, et al.
Published: (2026)
by: Pei, Siqi, et al.
Published: (2026)
AdaptOVCD: Training-Free Open-Vocabulary Remote Sensing Change Detection via Adaptive Information Fusion
by: Dou, Mingyu, et al.
Published: (2026)
by: Dou, Mingyu, et al.
Published: (2026)
Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models
by: Ren, Qin, et al.
Published: (2025)
by: Ren, Qin, et al.
Published: (2025)
Multilingual Training-Free Remote Sensing Image Captioning
by: Rebelo, Carlos, et al.
Published: (2025)
by: Rebelo, Carlos, et al.
Published: (2025)
MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery
by: Li, Yansheng, et al.
Published: (2025)
by: Li, Yansheng, et al.
Published: (2025)
Enabling Training-Free Text-Based Remote Sensing Segmentation
by: Sosa, Jose, et al.
Published: (2026)
by: Sosa, Jose, et al.
Published: (2026)
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
by: Li, Kaiyu, et al.
Published: (2024)
by: Li, Kaiyu, et al.
Published: (2024)
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition
by: Zheng, Yijie, et al.
Published: (2025)
by: Zheng, Yijie, et al.
Published: (2025)
RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification
by: Zou, Guangwenjie, et al.
Published: (2024)
by: Zou, Guangwenjie, et al.
Published: (2024)
Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception
by: Yang, Jiashu, et al.
Published: (2025)
by: Yang, Jiashu, et al.
Published: (2025)
Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images
by: Ding, Lei, et al.
Published: (2023)
by: Ding, Lei, et al.
Published: (2023)
Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding
by: Li, Yueying, et al.
Published: (2026)
by: Li, Yueying, et al.
Published: (2026)
Revisiting Change VQA in Remote Sensing with Structured and Native Multimodal Qwen Models
by: Bazi, Yakoub, et al.
Published: (2026)
by: Bazi, Yakoub, et al.
Published: (2026)
Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
by: Li, Kaiyu, et al.
Published: (2025)
by: Li, Kaiyu, et al.
Published: (2025)
GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery
by: Jiang, Lifan, et al.
Published: (2026)
by: Jiang, Lifan, et al.
Published: (2026)
Look-Ahead and Look-Back Flows: Training-Free Image Generation with Trajectory Smoothing
by: Luo, Yan, et al.
Published: (2026)
by: Luo, Yan, et al.
Published: (2026)
Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
by: Shabtay, Nimrod, et al.
Published: (2026)
by: Shabtay, Nimrod, et al.
Published: (2026)
Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection
by: Wang, Tao, et al.
Published: (2026)
by: Wang, Tao, et al.
Published: (2026)
LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning
by: Fu, Shenghao, et al.
Published: (2025)
by: Fu, Shenghao, et al.
Published: (2025)
Modularized Zero-shot VQA with Pre-trained Models
by: Cao, Rui, et al.
Published: (2023)
by: Cao, Rui, et al.
Published: (2023)
LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision
by: Fuller, Anthony, et al.
Published: (2025)
by: Fuller, Anthony, et al.
Published: (2025)
RS-Prune: Training-Free Data Pruning at High Ratios for Efficient Remote Sensing Diffusion Foundation Models
by: Wei, Fan, et al.
Published: (2025)
by: Wei, Fan, et al.
Published: (2025)
Remote Sensing Object Counting with Online Knowledge Learning
by: Jiang, Shengqin, et al.
Published: (2023)
by: Jiang, Shengqin, et al.
Published: (2023)
SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos
by: Gao, Mingqi, et al.
Published: (2025)
by: Gao, Mingqi, et al.
Published: (2025)
RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
by: Liu, Zhuoran, et al.
Published: (2024)
by: Liu, Zhuoran, et al.
Published: (2024)
Pay Attention to Where You Looked
by: Berian, Alex, et al.
Published: (2026)
by: Berian, Alex, et al.
Published: (2026)
Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
by: Li, Ke, et al.
Published: (2024)
by: Li, Ke, et al.
Published: (2024)
Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in
by: Shen, Xiaoqian, et al.
Published: (2025)
by: Shen, Xiaoqian, et al.
Published: (2025)
Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion
by: Wei, Meng, et al.
Published: (2026)
by: Wei, Meng, et al.
Published: (2026)
Seeing Clearly without Training: Mitigating Hallucinations in Multimodal LLMs for Remote Sensing
by: Liu, Yi, et al.
Published: (2026)
by: Liu, Yi, et al.
Published: (2026)
ConInfer: Context-Aware Inference for Training-Free Open-Vocabulary Remote Sensing Segmentation
by: Chen, Wenyang, et al.
Published: (2026)
by: Chen, Wenyang, et al.
Published: (2026)
UltraZoom: Generating Gigapixel Images from Regular Photos
by: Ma, Jingwei, et al.
Published: (2025)
by: Ma, Jingwei, et al.
Published: (2025)
PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
by: Yuan, Kun, et al.
Published: (2024)
by: Yuan, Kun, et al.
Published: (2024)
DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation
by: Li, Boyi, et al.
Published: (2025)
by: Li, Boyi, et al.
Published: (2025)
Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection
by: Du, Qiangang, et al.
Published: (2024)
by: Du, Qiangang, et al.
Published: (2024)
Ultra-Low Complexity On-Orbit Compression for Remote Sensing Imagery via Block Modulated Imaging
by: Wang, Zhibin, et al.
Published: (2024)
by: Wang, Zhibin, et al.
Published: (2024)
Similar Items
-
Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming
by: Zhou, Yue, et al.
Published: (2026) -
Adaptive Chain-of-Focus Reasoning via Dynamic Visual Search and Zooming for Efficient VLMs
by: Zhang, Xintong, et al.
Published: (2025) -
Inject Where It Matters: Training-Free Spatially-Adaptive Identity Preservation for Text-to-Image Personalization
by: Li, Guandong, et al.
Published: (2026) -
GRASP: Geospatial pixel Reasoning viA Structured Policy learning
by: Jiang, Chengjie, et al.
Published: (2025) -
AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement
by: Pei, Siqi, et al.
Published: (2026)