Saved in:
| Main Authors: | Lao, Danning, Liu, Qi, Bu, Jiazi, Yan, Junchi, Shen, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.17050 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Data-free Knowledge Distillation for Fine-grained Visual Categorization
by: Shao, Renrong, et al.
Published: (2024)
by: Shao, Renrong, et al.
Published: (2024)
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
by: Liu, Yu, et al.
Published: (2024)
by: Liu, Yu, et al.
Published: (2024)
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
by: Bi, Qi, et al.
Published: (2024)
by: Bi, Qi, et al.
Published: (2024)
Concept-wise Attention for Fine-grained Concept Bottleneck Models
by: Zhong, Minghong, et al.
Published: (2026)
by: Zhong, Minghong, et al.
Published: (2026)
SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition
by: Yang, Jingxiao, et al.
Published: (2026)
by: Yang, Jingxiao, et al.
Published: (2026)
Uni-Classifier: Leveraging Video Diffusion Priors for Universal Guidance Classifier
by: Zhou, Yujie, et al.
Published: (2026)
by: Zhou, Yujie, et al.
Published: (2026)
Boosting Order-Preserving and Transferability for Neural Architecture Search: a Joint Architecture Refined Search and Fine-tuning Approach
by: Zhang, Beichen, et al.
Published: (2024)
by: Zhang, Beichen, et al.
Published: (2024)
Fine-Grained GRPO for Precise Preference Alignment in Flow Models
by: Zhou, Yujie, et al.
Published: (2025)
by: Zhou, Yujie, et al.
Published: (2025)
Car-1000: A New Large Scale Fine-Grained Visual Categorization Dataset
by: Hu, Yutao, et al.
Published: (2025)
by: Hu, Yutao, et al.
Published: (2025)
SwimVG: Step-wise Multimodal Fusion and Adaption for Visual Grounding
by: Shi, Liangtao, et al.
Published: (2025)
by: Shi, Liangtao, et al.
Published: (2025)
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
by: Xu, Ran, et al.
Published: (2024)
by: Xu, Ran, et al.
Published: (2024)
Towards Low-latency Event-based Visual Recognition with Hybrid Step-wise Distillation Spiking Neural Networks
by: Zhong, Xian, et al.
Published: (2024)
by: Zhong, Xian, et al.
Published: (2024)
SoLA-Vision: Fine-grained Layer-wise Linear Softmax Hybrid Attention
by: Li, Ruibang, et al.
Published: (2026)
by: Li, Ruibang, et al.
Published: (2026)
Fast and Interpretable 2D Homography Decomposition: Similarity-Kernel-Similarity and Affine-Core-Affine Transformations
by: Cai, Shen, et al.
Published: (2024)
by: Cai, Shen, et al.
Published: (2024)
Context-Semantic Quality Awareness Network for Fine-Grained Visual Categorization
by: Xu, Qin, et al.
Published: (2024)
by: Xu, Qin, et al.
Published: (2024)
ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance
by: Chen, Congjia, et al.
Published: (2025)
by: Chen, Congjia, et al.
Published: (2025)
SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
by: Zhang, Jiacheng, et al.
Published: (2026)
by: Zhang, Jiacheng, et al.
Published: (2026)
FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions
by: Zhao, Peisen, et al.
Published: (2026)
by: Zhao, Peisen, et al.
Published: (2026)
Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation
by: Duggal, Shivam, et al.
Published: (2025)
by: Duggal, Shivam, et al.
Published: (2025)
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
by: He, Junwen, et al.
Published: (2024)
by: He, Junwen, et al.
Published: (2024)
DiCache: Let Diffusion Model Determine Its Own Cache
by: Bu, Jiazi, et al.
Published: (2025)
by: Bu, Jiazi, et al.
Published: (2025)
Unified Personalized Reward Model for Vision Generation
by: Wang, Yibin, et al.
Published: (2026)
by: Wang, Yibin, et al.
Published: (2026)
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning
by: Wang, Yuxuan, et al.
Published: (2024)
by: Wang, Yuxuan, et al.
Published: (2024)
Concept Drift and Long-Tailed Distribution in Fine-Grained Visual Categorization: Benchmark and Method
by: Ye, Shuo, et al.
Published: (2023)
by: Ye, Shuo, et al.
Published: (2023)
Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
by: Chen, Honghao, et al.
Published: (2025)
by: Chen, Honghao, et al.
Published: (2025)
Grounding and Enhancing Grid-based Models for Neural Fields
by: Zhao, Zelin, et al.
Published: (2024)
by: Zhao, Zelin, et al.
Published: (2024)
ViSurf: Visual Supervised-and-Reinforcement Fine-Tuning for Large Vision-and-Language Models
by: Liu, Yuqi, et al.
Published: (2025)
by: Liu, Yuqi, et al.
Published: (2025)
Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions
by: Wu, Tianxu, et al.
Published: (2023)
by: Wu, Tianxu, et al.
Published: (2023)
CausalFSFG: Rethinking Few-Shot Fine-Grained Visual Categorization from Causal Perspective
by: Yang, Zhiwen, et al.
Published: (2025)
by: Yang, Zhiwen, et al.
Published: (2025)
FineRMoE: Dimension Expansion for Finer-Grained Expert with Its Upcycling Approach
by: Liao, Ning, et al.
Published: (2026)
by: Liao, Ning, et al.
Published: (2026)
MLAE: Masked LoRA Experts for Visual Parameter-Efficient Fine-Tuning
by: Wang, Junjie, et al.
Published: (2024)
by: Wang, Junjie, et al.
Published: (2024)
Democratizing Fine-grained Visual Recognition with Large Language Models
by: Liu, Mingxuan, et al.
Published: (2024)
by: Liu, Mingxuan, et al.
Published: (2024)
FriendsQA: A New Large-Scale Deep Video Understanding Dataset with Fine-grained Topic Categorization for Story Videos
by: Wu, Zhengqian, et al.
Published: (2024)
by: Wu, Zhengqian, et al.
Published: (2024)
ViSpeak: Visual Instruction Feedback in Streaming Videos
by: Fu, Shenghao, et al.
Published: (2025)
by: Fu, Shenghao, et al.
Published: (2025)
Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
by: Pu, Yifan, et al.
Published: (2024)
by: Pu, Yifan, et al.
Published: (2024)
DRM: Diffusion-based Reward Model With Step-wise Guidance
by: Zhang, Jaxon, et al.
Published: (2026)
by: Zhang, Jaxon, et al.
Published: (2026)
Sub-token ViT Embedding via Stochastic Resonance Transformers
by: Lao, Dong, et al.
Published: (2023)
by: Lao, Dong, et al.
Published: (2023)
Learning Contrastive Self-Distillation for Ultra-Fine-Grained Visual Categorization Targeting Limited Samples
by: Fang, Ziye, et al.
Published: (2023)
by: Fang, Ziye, et al.
Published: (2023)
ViGoR: Improving Visual Grounding of Large Vision Language Models with Fine-Grained Reward Modeling
by: Yan, Siming, et al.
Published: (2024)
by: Yan, Siming, et al.
Published: (2024)
ViEEG: Hierarchical Visual Neural Representation for EEG Brain Decoding
by: Liu, Minxu, et al.
Published: (2025)
by: Liu, Minxu, et al.
Published: (2025)
Similar Items
-
Data-free Knowledge Distillation for Fine-grained Visual Categorization
by: Shao, Renrong, et al.
Published: (2024) -
Novel Class Discovery for Ultra-Fine-Grained Visual Categorization
by: Liu, Yu, et al.
Published: (2024) -
Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization
by: Bi, Qi, et al.
Published: (2024) -
Concept-wise Attention for Fine-grained Concept Bottleneck Models
by: Zhong, Minghong, et al.
Published: (2026) -
SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition
by: Yang, Jingxiao, et al.
Published: (2026)