Saved in:
| Main Authors: | Xu, Mengjie, Zhu, Yitao, Jiang, Haotian, Li, Jiaming, Shen, Zhenrong, Wang, Sheng, Huang, Haolin, Wang, Xinyu, Yang, Qing, Zhang, Han, Wang, Qian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.20111 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module
by: Wang, Xinyu, et al.
Published: (2024)
by: Wang, Xinyu, et al.
Published: (2024)
UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System
by: Zhu, Yitao, et al.
Published: (2025)
by: Zhu, Yitao, et al.
Published: (2025)
Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2025)
by: Zhu, Yitao, et al.
Published: (2025)
MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2023)
by: Zhu, Yitao, et al.
Published: (2023)
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
by: Zhu, Yitao, et al.
Published: (2024)
by: Zhu, Yitao, et al.
Published: (2024)
LEGO: Learning and Graph-Optimized Modular Tracker for Online Multi-Object Tracking with Point Clouds
by: Zhang, Zhenrong, et al.
Published: (2023)
by: Zhang, Zhenrong, et al.
Published: (2023)
Offline-Poly: A Polyhedral Framework For Offline 3D Multi-Object Tracking
by: Li, Xiaoyu, et al.
Published: (2026)
by: Li, Xiaoyu, et al.
Published: (2026)
VisionCAD: An Integration-Free Radiology Copilot Framework
by: Li, Jiaming, et al.
Published: (2025)
by: Li, Jiaming, et al.
Published: (2025)
ReactDiff: Latent Diffusion for Facial Reaction Generation
by: Li, Jiaming, et al.
Published: (2025)
by: Li, Jiaming, et al.
Published: (2025)
MMOT: The First Challenging Benchmark for Drone-based Multispectral Multi-Object Tracking
by: Li, Tianhao, et al.
Published: (2025)
by: Li, Tianhao, et al.
Published: (2025)
Omnidirectional Multi-Object Tracking
by: Luo, Kai, et al.
Published: (2025)
by: Luo, Kai, et al.
Published: (2025)
LSA: Latent Style Augmentation Towards Stain-Agnostic Cervical Cancer Screening
by: Cai, Jiangdong, et al.
Published: (2025)
by: Cai, Jiangdong, et al.
Published: (2025)
Inter-slice Super-resolution of Magnetic Resonance Images by Pre-training and Self-supervised Fine-tuning
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
Pathology Image Restoration via Mixture of Prompts
by: Cai, Jiangdong, et al.
Published: (2025)
by: Cai, Jiangdong, et al.
Published: (2025)
Two-stage Cytopathological Image Synthesis for Augmenting Cervical Abnormality Screening
by: Shen, Zhenrong, et al.
Published: (2024)
by: Shen, Zhenrong, et al.
Published: (2024)
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation
by: Jiang, Junjie, et al.
Published: (2025)
by: Jiang, Junjie, et al.
Published: (2025)
OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
by: Qian, Zekun, et al.
Published: (2024)
by: Qian, Zekun, et al.
Published: (2024)
Adversarial Attack for RGB-Event based Visual Object Tracking
by: Chen, Qiang, et al.
Published: (2025)
by: Chen, Qiang, et al.
Published: (2025)
DeTrack: In-model Latent Denoising Learning for Visual Object Tracking
by: Zhou, Xinyu, et al.
Published: (2025)
by: Zhou, Xinyu, et al.
Published: (2025)
View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV
by: Ji, Deyi, et al.
Published: (2024)
by: Ji, Deyi, et al.
Published: (2024)
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
by: Liu, Zhijian, et al.
Published: (2022)
by: Liu, Zhijian, et al.
Published: (2022)
MambaEVT: Event Stream based Visual Object Tracking using State Space Model
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
SDSTrack: Self-Distillation Symmetric Adapter Learning for Multi-Modal Visual Object Tracking
by: Hou, Xiaojun, et al.
Published: (2024)
by: Hou, Xiaojun, et al.
Published: (2024)
Unified Sequence-to-Sequence Learning for Single- and Multi-Modal Visual Object Tracking
by: Chen, Xin, et al.
Published: (2023)
by: Chen, Xin, et al.
Published: (2023)
Spatial Orthogonal Refinement for Robust RGB-Event Visual Object Tracking
by: Huang, Dexing, et al.
Published: (2026)
by: Huang, Dexing, et al.
Published: (2026)
Decoupling Amplitude and Phase Attention in Frequency Domain for RGB-Event based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2026)
by: Wang, Shiao, et al.
Published: (2026)
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework
by: Li, Xiaoyu, et al.
Published: (2024)
by: Li, Xiaoyu, et al.
Published: (2024)
Dynamic Pondering Sparsity-aware Mixture-of-Experts Transformer for Event Stream based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2026)
by: Wang, Shiao, et al.
Published: (2026)
Cross-View Referring Multi-Object Tracking
by: Chen, Sijia, et al.
Published: (2024)
by: Chen, Sijia, et al.
Published: (2024)
ACTrack: Adding Spatio-Temporal Condition for Visual Object Tracking
by: Han, Yushan, et al.
Published: (2024)
by: Han, Yushan, et al.
Published: (2024)
SRRT: Exploring Search Region Regulation for Visual Object Tracking
by: Zhu, Jiawen, et al.
Published: (2022)
by: Zhu, Jiawen, et al.
Published: (2022)
OmniTrack++: Omnidirectional Multi-Object Tracking by Learning Large-FoV Trajectory Feedback
by: Luo, Kai, et al.
Published: (2025)
by: Luo, Kai, et al.
Published: (2025)
Visual Object Tracking on Multi-modal RGB-D Videos: A Review
by: Zhu, Xue-Feng, et al.
Published: (2022)
by: Zhu, Xue-Feng, et al.
Published: (2022)
Multi-step Temporal Modeling for UAV Tracking
by: Yuan, Xiaoying, et al.
Published: (2024)
by: Yuan, Xiaoying, et al.
Published: (2024)
Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening
by: Kong, Yan, et al.
Published: (2024)
by: Kong, Yan, et al.
Published: (2024)
AerialMind: Towards Referring Multi-Object Tracking in UAV Scenarios
by: Chen, Chenglizhao, et al.
Published: (2025)
by: Chen, Chenglizhao, et al.
Published: (2025)
Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
MSITrack: A Challenging Benchmark for Multispectral Single Object Tracking
by: Feng, Tao, et al.
Published: (2025)
by: Feng, Tao, et al.
Published: (2025)
STORM: End-to-End Referring Multi-Object Tracking in Videos
by: Lu, Zijia, et al.
Published: (2026)
by: Lu, Zijia, et al.
Published: (2026)
Similar Items
-
DCIM-AVSR : Efficient Audio-Visual Speech Recognition via Dual Conformer Interaction Module
by: Wang, Xinyu, et al.
Published: (2024) -
UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System
by: Zhu, Yitao, et al.
Published: (2025) -
Med-LEGO: Editing and Adapting toward Generalist Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2025) -
MeLo: Low-rank Adaptation is Better than Fine-tuning for Medical Image Diagnosis
by: Zhu, Yitao, et al.
Published: (2023) -
MUC: Mixture of Uncalibrated Cameras for Robust 3D Human Body Reconstruction
by: Zhu, Yitao, et al.
Published: (2024)