:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhou, Yunqi, Jiang, Chengjie, Yuan, Chun, Li, Jing
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.20460
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Look-Closer-Then-Diagnose: Confidence-Aware Ultrasound VQA via Active Zooming
by: Zhou, Yue, et al.
Published: (2026)

Adaptive Chain-of-Focus Reasoning via Dynamic Visual Search and Zooming for Efficient VLMs
by: Zhang, Xintong, et al.
Published: (2025)

Inject Where It Matters: Training-Free Spatially-Adaptive Identity Preservation for Text-to-Image Personalization
by: Li, Guandong, et al.
Published: (2026)

GRASP: Geospatial pixel Reasoning viA Structured Policy learning
by: Jiang, Chengjie, et al.
Published: (2025)

AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement
by: Pei, Siqi, et al.
Published: (2026)

AdaptOVCD: Training-Free Open-Vocabulary Remote Sensing Change Detection via Adaptive Information Fusion
by: Dou, Mingyu, et al.
Published: (2026)

Scale Where It Matters: Training-Free Localized Scaling for Diffusion Models
by: Ren, Qin, et al.
Published: (2025)

Multilingual Training-Free Remote Sensing Image Captioning
by: Rebelo, Carlos, et al.
Published: (2025)

MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery
by: Li, Yansheng, et al.
Published: (2025)

Enabling Training-Free Text-Based Remote Sensing Segmentation
by: Sosa, Jose, et al.
Published: (2026)

SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images
by: Li, Kaiyu, et al.
Published: (2024)

InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition
by: Zheng, Yijie, et al.
Published: (2025)

RemoteTrimmer: Adaptive Structural Pruning for Remote Sensing Image Classification
by: Zou, Guangwenjie, et al.
Published: (2024)

Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception
by: Yang, Jiashu, et al.
Published: (2025)

Adapting Segment Anything Model for Change Detection in HR Remote Sensing Images
by: Ding, Lei, et al.
Published: (2023)

Semantic-Geometric Dual Compression: Training-Free Visual Token Reduction for Ultra-High-Resolution Remote Sensing Understanding
by: Li, Yueying, et al.
Published: (2026)

Revisiting Change VQA in Remote Sensing with Structured and Native Multimodal Qwen Models
by: Bazi, Yakoub, et al.
Published: (2026)

Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
by: Li, Kaiyu, et al.
Published: (2025)

GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery
by: Jiang, Lifan, et al.
Published: (2026)

Look-Ahead and Look-Back Flows: Training-Free Image Generation with Trajectory Smoothing
by: Luo, Yan, et al.
Published: (2026)

Look Where It Matters: High-Resolution Crops Retrieval for Efficient VLMs
by: Shabtay, Nimrod, et al.
Published: (2026)

Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection
by: Wang, Tao, et al.
Published: (2026)

LOVE-R1: Advancing Long Video Understanding with an Adaptive Zoom-in Mechanism via Multi-Step Reasoning
by: Fu, Shenghao, et al.
Published: (2025)

Modularized Zero-shot VQA with Pre-trained Models
by: Cao, Rui, et al.
Published: (2023)

LookWhere? Efficient Visual Recognition by Learning Where to Look and What to See from Self-Supervision
by: Fuller, Anthony, et al.
Published: (2025)

RS-Prune: Training-Free Data Pruning at High Ratios for Efficient Remote Sensing Diffusion Foundation Models
by: Wei, Fan, et al.
Published: (2025)

Remote Sensing Object Counting with Online Knowledge Learning
by: Jiang, Shengqin, et al.
Published: (2023)

SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos
by: Gao, Mingqi, et al.
Published: (2025)

RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents
by: Liu, Zhuoran, et al.
Published: (2024)

Pay Attention to Where You Looked
by: Berian, Alex, et al.
Published: (2026)

Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
by: Li, Ke, et al.
Published: (2024)

Zoom-Zero: Reinforced Coarse-to-Fine Video Understanding via Temporal Zoom-in
by: Shen, Xiaoqian, et al.
Published: (2025)

Where It Moves, It Matters: Referring Surgical Instrument Segmentation via Motion
by: Wei, Meng, et al.
Published: (2026)

Seeing Clearly without Training: Mitigating Hallucinations in Multimodal LLMs for Remote Sensing
by: Liu, Yi, et al.
Published: (2026)

ConInfer: Context-Aware Inference for Training-Free Open-Vocabulary Remote Sensing Segmentation
by: Chen, Wenyang, et al.
Published: (2026)

UltraZoom: Generating Gigapixel Images from Regular Photos
by: Ma, Jingwei, et al.
Published: (2025)

PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild
by: Yuan, Kun, et al.
Published: (2024)

DGL-RSIS: Decoupling Global Spatial Context and Local Class Semantics for Training-Free Remote Sensing Image Segmentation
by: Li, Boyi, et al.
Published: (2025)

Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection
by: Du, Qiangang, et al.
Published: (2024)

Ultra-Low Complexity On-Orbit Compression for Remote Sensing Imagery via Block Modulated Imaging
by: Wang, Zhibin, et al.
Published: (2024)