Saved in:
| Main Authors: | Zhang, Xufei, Zhou, Xinjiao, Deng, Ziling, Geng, Dongdong, Wang, Jianxiong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.03530 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
by: Qu, Renyi, et al.
Published: (2024)
by: Qu, Renyi, et al.
Published: (2024)
UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025)
by: Zhou, Yue, et al.
Published: (2025)
Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification
by: Gao, Yibo, et al.
Published: (2025)
by: Gao, Yibo, et al.
Published: (2025)
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction Tuning
by: Deng, Pei, et al.
Published: (2024)
by: Deng, Pei, et al.
Published: (2024)
AD-DINOv3: Enhancing DINOv3 for Zero-Shot Anomaly Detection with Anomaly-Aware Calibration
by: Yuan, Jingyi, et al.
Published: (2025)
by: Yuan, Jingyi, et al.
Published: (2025)
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
by: Chen, Ziwei, et al.
Published: (2025)
by: Chen, Ziwei, et al.
Published: (2025)
Weather-R1: Logically Consistent Reinforcement Fine-Tuning for Multimodal Reasoning in Meteorology
by: Wu, Kaiyu, et al.
Published: (2026)
by: Wu, Kaiyu, et al.
Published: (2026)
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning
by: Zhu, Liyun, et al.
Published: (2025)
by: Zhu, Liyun, et al.
Published: (2025)
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
by: He, Junwen, et al.
Published: (2024)
by: He, Junwen, et al.
Published: (2024)
DEFT: Decompositional Efficient Fine-Tuning for Text-to-Image Models
by: Kumar, Komal, et al.
Published: (2025)
by: Kumar, Komal, et al.
Published: (2025)
A High-Quality Text-Rich Image Instruction Tuning Dataset via Hybrid Instruction Generation
by: Zhou, Shijie, et al.
Published: (2024)
by: Zhou, Shijie, et al.
Published: (2024)
Reinforcement Fine-Tuning Powers Reasoning Capability of Multimodal Large Language Models
by: Sun, Haoyuan, et al.
Published: (2025)
by: Sun, Haoyuan, et al.
Published: (2025)
Streaming Video Instruction Tuning
by: Xia, Jiaer, et al.
Published: (2025)
by: Xia, Jiaer, et al.
Published: (2025)
Component-aware Unsupervised Logical Anomaly Generation for Industrial Anomaly Detection
by: Tong, Xuan, et al.
Published: (2025)
by: Tong, Xuan, et al.
Published: (2025)
MotionRFT: Unified Reinforcement Fine-Tuning for Text-to-Motion Generation
by: Tan, Xiaofeng, et al.
Published: (2026)
by: Tan, Xiaofeng, et al.
Published: (2026)
COFT-AD: COntrastive Fine-Tuning for Few-Shot Anomaly Detection
by: Liao, Jingyi, et al.
Published: (2024)
by: Liao, Jingyi, et al.
Published: (2024)
RB-FT: Rationale-Bootstrapped Fine-Tuning for Video Classification
by: Xu, Meilong, et al.
Published: (2025)
by: Xu, Meilong, et al.
Published: (2025)
Learning A Multi-Task Transformer Via Unified And Customized Instruction Tuning For Chest Radiograph Interpretation
by: Xu, Lijian, et al.
Published: (2023)
by: Xu, Lijian, et al.
Published: (2023)
InstructAttribute: Fine-grained Object Attributes editing with Instruction
by: Yin, Xingxi, et al.
Published: (2025)
by: Yin, Xingxi, et al.
Published: (2025)
Multimodal Instruction Tuning with Hybrid State Space Models
by: Zhou, Jianing, et al.
Published: (2024)
by: Zhou, Jianing, et al.
Published: (2024)
LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning
by: Shi, Yiming, et al.
Published: (2024)
by: Shi, Yiming, et al.
Published: (2024)
Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization
by: Deng, Hanqiu, et al.
Published: (2023)
by: Deng, Hanqiu, et al.
Published: (2023)
Personalized Visual Instruction Tuning
by: Pi, Renjie, et al.
Published: (2024)
by: Pi, Renjie, et al.
Published: (2024)
Gradient-based Parameter Selection for Efficient Fine-Tuning
by: Zhang, Zhi, et al.
Published: (2023)
by: Zhang, Zhi, et al.
Published: (2023)
Visual Instruction Tuning with Chain of Region-of-Interest
by: Chen, Yixin, et al.
Published: (2025)
by: Chen, Yixin, et al.
Published: (2025)
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning
by: Wang, Qi, et al.
Published: (2025)
by: Wang, Qi, et al.
Published: (2025)
Human Motion Instruction Tuning
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
UniADC: A Unified Framework for Anomaly Detection and Classification
by: Zhang, Ximiao, et al.
Published: (2025)
by: Zhang, Ximiao, et al.
Published: (2025)
PanAdapter: Two-Stage Fine-Tuning with Spatial-Spectral Priors Injecting for Pansharpening
by: Wu, RuoCheng, et al.
Published: (2024)
by: Wu, RuoCheng, et al.
Published: (2024)
Semantic Hierarchical Prompt Tuning for Parameter-Efficient Fine-Tuning
by: Zhu, Haowei, et al.
Published: (2024)
by: Zhu, Haowei, et al.
Published: (2024)
VLN-R1: Vision-Language Navigation via Reinforcement Fine-Tuning
by: Qi, Zhangyang, et al.
Published: (2025)
by: Qi, Zhangyang, et al.
Published: (2025)
DiffusionPID: Interpreting Diffusion via Partial Information Decomposition
by: Zawar, Rushikesh, et al.
Published: (2024)
by: Zawar, Rushikesh, et al.
Published: (2024)
PeftCD: Leveraging Vision Foundation Models with Parameter-Efficient Fine-Tuning for Remote Sensing Change Detection
by: Dong, Sijun, et al.
Published: (2025)
by: Dong, Sijun, et al.
Published: (2025)
Interpretable Multimodal Out-of-context Detection with Soft Logic Regularization
by: Ma, Huanhuan, et al.
Published: (2024)
by: Ma, Huanhuan, et al.
Published: (2024)
What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Instruction Tuning
by: Du, Yifan, et al.
Published: (2023)
by: Du, Yifan, et al.
Published: (2023)
Parameter-Efficient Fine-Tuning of Multispectral Foundation Models for Hyperspectral Image Classification
by: Ligan, Bernardin, et al.
Published: (2025)
by: Ligan, Bernardin, et al.
Published: (2025)
Deep Instruction Tuning for Segment Anything Model
by: Huang, Xiaorui, et al.
Published: (2024)
by: Huang, Xiaorui, et al.
Published: (2024)
CoFInAl: Enhancing Action Quality Assessment with Coarse-to-Fine Instruction Alignment
by: Zhou, Kanglei, et al.
Published: (2024)
by: Zhou, Kanglei, et al.
Published: (2024)
Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance
by: Li, Songze, et al.
Published: (2025)
by: Li, Songze, et al.
Published: (2025)
Semi-Supervised Fine-Tuning of Vision Foundation Models with Content-Style Decomposition
by: Drozdova, Mariia, et al.
Published: (2024)
by: Drozdova, Mariia, et al.
Published: (2024)
Similar Items
-
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
by: Qu, Renyi, et al.
Published: (2024) -
UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025) -
Learning Concept-Driven Logical Rules for Interpretable and Generalizable Medical Image Classification
by: Gao, Yibo, et al.
Published: (2025) -
ChangeChat: An Interactive Model for Remote Sensing Change Analysis via Multimodal Instruction Tuning
by: Deng, Pei, et al.
Published: (2024) -
AD-DINOv3: Enhancing DINOv3 for Zero-Shot Anomaly Detection with Anomaly-Aware Calibration
by: Yuan, Jingyi, et al.
Published: (2025)