Saved in:
| Main Authors: | Xue, Yu, Qu, Haoxuan, Li, Zhuoling, Lou, Yihang, Bai, Yan, Rahmani, Hossein, Liu, Jun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.02518 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DisC-GS: Discontinuity-aware Gaussian Splatting
by: Qu, Haoxuan, et al.
Published: (2024)
by: Qu, Haoxuan, et al.
Published: (2024)
An Image-like Diffusion Method for Human-Object Interaction Detection
by: Hui, Xiaofei, et al.
Published: (2025)
by: Hui, Xiaofei, et al.
Published: (2025)
LongDiff: Training-Free Long Video Generation in One Go
by: Li, Zhuoling, et al.
Published: (2025)
by: Li, Zhuoling, et al.
Published: (2025)
MicroscopyMatching: Towards a Ready-to-use Framework for Microscopy Image Analysis in Diverse Conditions
by: Hui, Xiaofei, et al.
Published: (2026)
by: Hui, Xiaofei, et al.
Published: (2026)
Recent Advances of Continual Learning in Computer Vision: An Overview
by: Qu, Haoxuan, et al.
Published: (2021)
by: Qu, Haoxuan, et al.
Published: (2021)
Translating Signals to Languages for sEMG-Based Activity Recognition
by: Wang, Ming, et al.
Published: (2026)
by: Wang, Ming, et al.
Published: (2026)
When Visual Privacy Protection Meets Multimodal Large Language Models
by: Hui, Xiaofei, et al.
Published: (2026)
by: Hui, Xiaofei, et al.
Published: (2026)
A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction
by: Qu, Haoxuan, et al.
Published: (2025)
by: Qu, Haoxuan, et al.
Published: (2025)
TIGER-FG: Text-Guided Implicit Fine-Grained Grounding for E-commerce Retrieval
by: Sun, Xinyu, et al.
Published: (2026)
by: Sun, Xinyu, et al.
Published: (2026)
FG$^2$: Fine-Grained Cross-View Localization by Fine-Grained Feature Matching
by: Xia, Zimin, et al.
Published: (2025)
by: Xia, Zimin, et al.
Published: (2025)
Learning to Generate Cross-Task Unexploitable Examples
by: Qu, Haoxuan, et al.
Published: (2025)
by: Qu, Haoxuan, et al.
Published: (2025)
FG-CLIP: Fine-Grained Visual and Textual Alignment
by: Xie, Chunyu, et al.
Published: (2025)
by: Xie, Chunyu, et al.
Published: (2025)
Hierarchical Contextual Grounding LVLM: Enhancing Fine-Grained Visual-Language Understanding with Robust Grounding
by: Guo, Leilei, et al.
Published: (2025)
by: Guo, Leilei, et al.
Published: (2025)
TSTMotion: Training-free Scene-aware Text-to-motion Generation
by: Guo, Ziyan, et al.
Published: (2025)
by: Guo, Ziyan, et al.
Published: (2025)
Bayesian Evidential Learning for Few-Shot Classification
by: Linghu, Xiongkun, et al.
Published: (2022)
by: Linghu, Xiongkun, et al.
Published: (2022)
FG-Attn: Leveraging Fine-Grained Sparsity In Diffusion Transformers
by: Durvasula, Sankeerth, et al.
Published: (2025)
by: Durvasula, Sankeerth, et al.
Published: (2025)
FG-SGL: Fine-Grained Semantic Guidance Learning via Motion Process Decomposition for Micro-Gesture Recognition
by: Wei, Jinsheng, et al.
Published: (2026)
by: Wei, Jinsheng, et al.
Published: (2026)
LLM-based Hierarchical Concept Decomposition for Interpretable Fine-Grained Image Classification
by: Qu, Renyi, et al.
Published: (2024)
by: Qu, Renyi, et al.
Published: (2024)
Action Detection via an Image Diffusion Process
by: Foo, Lin Geng, et al.
Published: (2024)
by: Foo, Lin Geng, et al.
Published: (2024)
Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks
by: Jin, Jing, et al.
Published: (2026)
by: Jin, Jing, et al.
Published: (2026)
Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
by: Zhou, Feixiang, et al.
Published: (2024)
by: Zhou, Feixiang, et al.
Published: (2024)
CGC: Compositional Grounded Contrast for Fine-Grained Multi-Image Understanding
by: Zheng, Lihao, et al.
Published: (2026)
by: Zheng, Lihao, et al.
Published: (2026)
Proto-FG3D: Prototype-based Interpretable Fine-Grained 3D Shape Classification
by: Ma, Shuxian, et al.
Published: (2025)
by: Ma, Shuxian, et al.
Published: (2025)
Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers
by: Zhang, Zhengbo, et al.
Published: (2024)
by: Zhang, Zhengbo, et al.
Published: (2024)
AI-Generated Content (AIGC) for Various Data Modalities: A Survey
by: Foo, Lin Geng, et al.
Published: (2023)
by: Foo, Lin Geng, et al.
Published: (2023)
AFiRe: Anatomy-Driven Self-Supervised Learning for Fine-Grained Representation in Radiographic Images
by: Liu, Yihang, et al.
Published: (2025)
by: Liu, Yihang, et al.
Published: (2025)
LLMs are Good Action Recognizers
by: Qu, Haoxuan, et al.
Published: (2024)
by: Qu, Haoxuan, et al.
Published: (2024)
GPT-Connect: Interaction between Text-Driven Human Motion Generator and 3D Scenes in a Training-free Manner
by: Qu, Haoxuan, et al.
Published: (2024)
by: Qu, Haoxuan, et al.
Published: (2024)
HMIL: Hierarchical Multi-Instance Learning for Fine-Grained Whole Slide Image Classification
by: Jin, Cheng, et al.
Published: (2024)
by: Jin, Cheng, et al.
Published: (2024)
SceneLLM: Implicit Language Reasoning in LLM for Dynamic Scene Graph Generation
by: Zhang, Hang, et al.
Published: (2024)
by: Zhang, Hang, et al.
Published: (2024)
Towards Fine-Grained Recognition with Large Visual Language Models: Benchmark and Optimization Strategies
by: Pang, Cong, et al.
Published: (2025)
by: Pang, Cong, et al.
Published: (2025)
Federated CLIP for Resource-Efficient Heterogeneous Medical Image Classification
by: Wu, Yihang, et al.
Published: (2025)
by: Wu, Yihang, et al.
Published: (2025)
Fine-Grained ImageNet Classification in the Wild
by: Lymperaiou, Maria, et al.
Published: (2023)
by: Lymperaiou, Maria, et al.
Published: (2023)
Generalizable Whole Slide Image Classification with Fine-Grained Visual-Semantic Interaction
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Fine-Grained Scene Image Classification with Modality-Agnostic Adapter
by: Wang, Yiqun, et al.
Published: (2024)
by: Wang, Yiqun, et al.
Published: (2024)
Towards Unified 3D Object Detection via Algorithm and Data Unification
by: Li, Zhuoling, et al.
Published: (2024)
by: Li, Zhuoling, et al.
Published: (2024)
Towards a Transparent and Interpretable AI Model for Medical Image Classifications
by: Wen, Binbin, et al.
Published: (2025)
by: Wen, Binbin, et al.
Published: (2025)
CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
by: Li, Yan, et al.
Published: (2025)
by: Li, Yan, et al.
Published: (2025)
Toward Unified Fine-Grained Vehicle Classification and Automatic License Plate Recognition
by: Lima, Gabriel E., et al.
Published: (2026)
by: Lima, Gabriel E., et al.
Published: (2026)
Towards Privacy-Preserving Fine-Grained Visual Classification via Hierarchical Learning from Label Proportions
by: Chang, Jinyi, et al.
Published: (2025)
by: Chang, Jinyi, et al.
Published: (2025)
Similar Items
-
DisC-GS: Discontinuity-aware Gaussian Splatting
by: Qu, Haoxuan, et al.
Published: (2024) -
An Image-like Diffusion Method for Human-Object Interaction Detection
by: Hui, Xiaofei, et al.
Published: (2025) -
LongDiff: Training-Free Long Video Generation in One Go
by: Li, Zhuoling, et al.
Published: (2025) -
MicroscopyMatching: Towards a Ready-to-use Framework for Microscopy Image Analysis in Diverse Conditions
by: Hui, Xiaofei, et al.
Published: (2026) -
Recent Advances of Continual Learning in Computer Vision: An Overview
by: Qu, Haoxuan, et al.
Published: (2021)