Saved in:
| Main Authors: | Ding, Hao, Yang, Zhichuan, Ge, Weijie, Gao, Ziqin, Lu, Chaoyi, Zhao, Lei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.14482 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception
by: Sun, Yiding, et al.
Published: (2026)
by: Sun, Yiding, et al.
Published: (2026)
Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning
by: Zhao, Zhixian, et al.
Published: (2026)
by: Zhao, Zhixian, et al.
Published: (2026)
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
by: von Klinski, Maximilian, et al.
Published: (2026)
by: von Klinski, Maximilian, et al.
Published: (2026)
PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning
by: Wang, Yankai, et al.
Published: (2026)
by: Wang, Yankai, et al.
Published: (2026)
Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification
by: Gao, Pengxiang, et al.
Published: (2025)
by: Gao, Pengxiang, et al.
Published: (2025)
Corrected with the Latest Version: Make Robust Asynchronous Federated Learning Possible
by: Lu, Chaoyi, et al.
Published: (2025)
by: Lu, Chaoyi, et al.
Published: (2025)
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
by: Wu, Xueqing, et al.
Published: (2024)
by: Wu, Xueqing, et al.
Published: (2024)
Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning of Vision Language Models
by: Tan, Huajie, et al.
Published: (2025)
by: Tan, Huajie, et al.
Published: (2025)
PROPA: Toward Process-level Optimization in Visual Reasoning via Reinforcement Learning
by: Jiang, Yanbei, et al.
Published: (2025)
by: Jiang, Yanbei, et al.
Published: (2025)
FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning
by: Zhang, Lu, et al.
Published: (2025)
by: Zhang, Lu, et al.
Published: (2025)
UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning
by: Bai, Sule, et al.
Published: (2025)
by: Bai, Sule, et al.
Published: (2025)
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning
by: Zhang, Zhenyu, et al.
Published: (2023)
by: Zhang, Zhenyu, et al.
Published: (2023)
Visual Reasoning through Tool-supervised Reinforcement Learning
by: Dong, Qihua, et al.
Published: (2026)
by: Dong, Qihua, et al.
Published: (2026)
HOLO: Homography-Guided Pose Estimator Network for Fine-Grained Visual Localization on SD Maps
by: Zhong, Xuchang, et al.
Published: (2026)
by: Zhong, Xuchang, et al.
Published: (2026)
ASPO: Adaptive Sentence-Level Preference Optimization for Fine-Grained Multimodal Reasoning
by: Wang, Yeyuan, et al.
Published: (2025)
by: Wang, Yeyuan, et al.
Published: (2025)
Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation
by: Yang, Guangjing, et al.
Published: (2026)
by: Yang, Guangjing, et al.
Published: (2026)
TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning
by: Greisinger, Christian, et al.
Published: (2026)
by: Greisinger, Christian, et al.
Published: (2026)
Lang2Act: Fine-Grained Visual Reasoning through Self-Emergent Linguistic Toolchains
by: Xiong, Yuqi, et al.
Published: (2026)
by: Xiong, Yuqi, et al.
Published: (2026)
Guiding the Inner Eye: A Framework for Hierarchical and Flexible Visual Grounded Reasoning
by: Wei, Zhaoyang, et al.
Published: (2025)
by: Wei, Zhaoyang, et al.
Published: (2025)
R-AVST: Empowering Video-LLMs with Fine-Grained Spatio-Temporal Reasoning in Complex Audio-Visual Scenarios
by: Zhu, Lu, et al.
Published: (2025)
by: Zhu, Lu, et al.
Published: (2025)
Fine-Grained VLM Fine-tuning via Latent Hierarchical Adapter Learning
by: Zhao, Yumiao, et al.
Published: (2025)
by: Zhao, Yumiao, et al.
Published: (2025)
Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement
by: Huang, Xin, et al.
Published: (2026)
by: Huang, Xin, et al.
Published: (2026)
Fine-Grained Representation for Lane Topology Reasoning
by: Xu, Guoqing, et al.
Published: (2025)
by: Xu, Guoqing, et al.
Published: (2025)
VER-Bench: Evaluating MLLMs on Reasoning with Fine-Grained Visual Evidence
by: Qiang, Chenhui, et al.
Published: (2025)
by: Qiang, Chenhui, et al.
Published: (2025)
Fine-Grained Instruction-Guided Graph Reasoning for Vision-and-Language Navigation
by: Liu, Yaohua, et al.
Published: (2025)
by: Liu, Yaohua, et al.
Published: (2025)
ReasonMap: Towards Fine-Grained Visual Reasoning from Transit Maps
by: Feng, Sicheng, et al.
Published: (2025)
by: Feng, Sicheng, et al.
Published: (2025)
Expert Knowledge-Guided Decision Calibration for Accurate Fine-Grained Tree Species Classification
by: Long, Chen, et al.
Published: (2026)
by: Long, Chen, et al.
Published: (2026)
VistaGEN: Consistent Driving Video Generation with Fine-Grained Control Using Multiview Visual-Language Reasoning
by: Chen, Li-Heng, et al.
Published: (2026)
by: Chen, Li-Heng, et al.
Published: (2026)
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ
by: Belouadi, Jonas, et al.
Published: (2023)
by: Belouadi, Jonas, et al.
Published: (2023)
MeshArt: Generating Articulated Meshes with Structure-Guided Transformers
by: Gao, Daoyi, et al.
Published: (2024)
by: Gao, Daoyi, et al.
Published: (2024)
Uncertainty Guided Refinement for Fine-Grained Salient Object Detection
by: Yuan, Yao, et al.
Published: (2025)
by: Yuan, Yao, et al.
Published: (2025)
Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning
by: Tang, Haomiao, et al.
Published: (2026)
by: Tang, Haomiao, et al.
Published: (2026)
AURA: A Fine-Grained Benchmark and Decomposed Metric for Audio-Visual Reasoning
by: Galougah, Siminfar Samakoush, et al.
Published: (2025)
by: Galougah, Siminfar Samakoush, et al.
Published: (2025)
Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks
by: Jin, Jing, et al.
Published: (2026)
by: Jin, Jing, et al.
Published: (2026)
ReAlign: Optimizing the Visual Document Retriever with Reasoning-Guided Fine-Grained Alignment
by: Yang, Hao, et al.
Published: (2026)
by: Yang, Hao, et al.
Published: (2026)
Discovering Fine-Grained Visual-Concept Relations by Disentangled Optimal Transport Concept Bottleneck Models
by: Xie, Yan, et al.
Published: (2025)
by: Xie, Yan, et al.
Published: (2025)
Grounded Reinforcement Learning for Visual Reasoning
by: Sarch, Gabriel, et al.
Published: (2025)
by: Sarch, Gabriel, et al.
Published: (2025)
LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition
by: Shi, Jialu, et al.
Published: (2024)
by: Shi, Jialu, et al.
Published: (2024)
Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning
by: Xia, Jiaer, et al.
Published: (2025)
by: Xia, Jiaer, et al.
Published: (2025)
Similar Items
-
Align then Adapt: Rethinking Parameter-Efficient Transfer Learning in 4D Perception
by: Sun, Yiding, et al.
Published: (2026) -
Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning
by: Zhao, Zhixian, et al.
Published: (2026) -
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
by: von Klinski, Maximilian, et al.
Published: (2026) -
PointRFT: Explicit Reinforcement Fine-tuning for Point Cloud Few-shot Learning
by: Wang, Yankai, et al.
Published: (2026) -
Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification
by: Gao, Pengxiang, et al.
Published: (2025)