Saved in:
| Main Authors: | Liu, Yu, Mahmood, Arif, Khan, Muhammad Haris |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.20421 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Depth Attention for Robust RGB Tracking
by: Liu, Yu, et al.
Published: (2024)
by: Liu, Yu, et al.
Published: (2024)
Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels
by: Marrium, Maria, et al.
Published: (2024)
by: Marrium, Maria, et al.
Published: (2024)
EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark
by: Zhang, Deheng, et al.
Published: (2025)
by: Zhang, Deheng, et al.
Published: (2025)
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
by: Zhang, Chunhui, et al.
Published: (2024)
by: Zhang, Chunhui, et al.
Published: (2024)
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
by: Tourani, Siddharth, et al.
Published: (2024)
by: Tourani, Siddharth, et al.
Published: (2024)
microCLIP: Unsupervised CLIP Adaptation via Coarse-Fine Token Fusion for Fine-Grained Image Classification
by: Silva, Sathira, et al.
Published: (2025)
by: Silva, Sathira, et al.
Published: (2025)
Unsupervised Deep Graph Matching Based on Cycle Consistency
by: Tourani, Siddharth, et al.
Published: (2023)
by: Tourani, Siddharth, et al.
Published: (2023)
A Comprehensive Survey on Deep Learning Solutions for 3D Flood Mapping
by: Jia, Wenfeng, et al.
Published: (2025)
by: Jia, Wenfeng, et al.
Published: (2025)
Predicting the Best of N Visual Trackers
by: Alawode, Basit, et al.
Published: (2024)
by: Alawode, Basit, et al.
Published: (2024)
Event Stream-based Visual Object Tracking: HDETrack V2 and A High-Definition Benchmark
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
Serial Over Parallel: Learning Continual Unification for Multi-Modal Visual Object Tracking and Benchmarking
by: Tang, Zhangyong, et al.
Published: (2025)
by: Tang, Zhangyong, et al.
Published: (2025)
Adversarial Attacks on Audio Deepfake Detection: A Benchmark and Comparative Study
by: Uddin, Kutub, et al.
Published: (2025)
by: Uddin, Kutub, et al.
Published: (2025)
MPT: A Large-scale Multi-Phytoplankton Tracking Benchmark
by: Yu, Yang, et al.
Published: (2024)
by: Yu, Yang, et al.
Published: (2024)
Visual Object Tracking across Diverse Data Modalities: A Review
by: Wang, Mengmeng, et al.
Published: (2024)
by: Wang, Mengmeng, et al.
Published: (2024)
Waste-Bench: A Comprehensive Benchmark for Evaluating VLLMs in Cluttered Environments
by: Ali, Muhammad, et al.
Published: (2025)
by: Ali, Muhammad, et al.
Published: (2025)
OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking
by: Qian, Zekun, et al.
Published: (2024)
by: Qian, Zekun, et al.
Published: (2024)
Are We Making Progress in Multimodal Domain Generalization? A Comprehensive Benchmark Study
by: Dong, Hao, et al.
Published: (2026)
by: Dong, Hao, et al.
Published: (2026)
LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset
by: Aithal, Manjushree, et al.
Published: (2025)
by: Aithal, Manjushree, et al.
Published: (2025)
RELO: Reinforcement Learning to Localize for Visual Object Tracking
by: Chen, Xin, et al.
Published: (2026)
by: Chen, Xin, et al.
Published: (2026)
MITracker: Multi-View Integration for Visual Object Tracking
by: Xu, Mengjie, et al.
Published: (2025)
by: Xu, Mengjie, et al.
Published: (2025)
ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes
by: Malik, Hashmat Shadab, et al.
Published: (2024)
by: Malik, Hashmat Shadab, et al.
Published: (2024)
ATSTrack: Enhancing Visual-Language Tracking by Aligning Temporal and Spatial Scales
by: Zhen, Yihao, et al.
Published: (2025)
by: Zhen, Yihao, et al.
Published: (2025)
AgroVG: A Large-Scale Multi-Source Benchmark for Agricultural Visual Grounding
by: Li, Haocheng, et al.
Published: (2026)
by: Li, Haocheng, et al.
Published: (2026)
Adversarial Attack for RGB-Event based Visual Object Tracking
by: Chen, Qiang, et al.
Published: (2025)
by: Chen, Qiang, et al.
Published: (2025)
DRMOT: A Dataset and Framework for RGBD Referring Multi-Object Tracking
by: Chen, Sijia, et al.
Published: (2026)
by: Chen, Sijia, et al.
Published: (2026)
Tell me Habibi, is it Real or Fake?
by: Kuckreja, Kartik, et al.
Published: (2025)
by: Kuckreja, Kartik, et al.
Published: (2025)
LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models
by: Liao, Pan, et al.
Published: (2026)
by: Liao, Pan, et al.
Published: (2026)
ReefNet: A Large-Scale Dataset and Benchmark for Fine-Grained Coral Reef Recognition
by: Felemban, Abdulwahab, et al.
Published: (2025)
by: Felemban, Abdulwahab, et al.
Published: (2025)
Cross-View Referring Multi-Object Tracking
by: Chen, Sijia, et al.
Published: (2024)
by: Chen, Sijia, et al.
Published: (2024)
Long-Term Visual Object Tracking with Event Cameras: An Associative Memory Augmented Tracker and A Benchmark Dataset
by: Wang, Xiao, et al.
Published: (2024)
by: Wang, Xiao, et al.
Published: (2024)
FastTrackTr:Towards Fast Multi-Object Tracking with Transformers
by: Liao, Pan, et al.
Published: (2024)
by: Liao, Pan, et al.
Published: (2024)
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis
by: Alawode, Basit, et al.
Published: (2025)
by: Alawode, Basit, et al.
Published: (2025)
Awesome Multi-modal Object Tracking
by: Zhang, Chunhui, et al.
Published: (2024)
by: Zhang, Chunhui, et al.
Published: (2024)
TAO-Amodal: A Benchmark for Tracking Any Object Amodally
by: Hsieh, Cheng-Yen, et al.
Published: (2023)
by: Hsieh, Cheng-Yen, et al.
Published: (2023)
MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage
by: Khan, Ufaq, et al.
Published: (2026)
by: Khan, Ufaq, et al.
Published: (2026)
Real-Time Object Detection in Occluded Environment with Background Cluttering Effects Using Deep Learning
by: Aamir, Syed Muhammad, et al.
Published: (2024)
by: Aamir, Syed Muhammad, et al.
Published: (2024)
Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology
by: Ji, Yatai, et al.
Published: (2025)
by: Ji, Yatai, et al.
Published: (2025)
Towards Low-Latency Event Stream-based Visual Object Tracking: A Slow-Fast Approach
by: Wang, Shiao, et al.
Published: (2025)
by: Wang, Shiao, et al.
Published: (2025)
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis
by: Wei, Jianhui, et al.
Published: (2025)
by: Wei, Jianhui, et al.
Published: (2025)
Multi-modal Medical Image Fusion For Non-Small Cell Lung Cancer Classification
by: Hassan, Salma, et al.
Published: (2024)
by: Hassan, Salma, et al.
Published: (2024)
Similar Items
-
Depth Attention for Robust RGB Tracking
by: Liu, Yu, et al.
Published: (2024) -
Implicit to Explicit Entropy Regularization: Benchmarking ViT Fine-tuning under Noisy Labels
by: Marrium, Maria, et al.
Published: (2024) -
EgoNight: Towards Egocentric Vision Understanding at Night with a Challenging Benchmark
by: Zhang, Deheng, et al.
Published: (2025) -
WebUOT-1M: Advancing Deep Underwater Object Tracking with A Million-Scale Benchmark
by: Zhang, Chunhui, et al.
Published: (2024) -
Pose-Guided Self-Training with Two-Stage Clustering for Unsupervised Landmark Discovery
by: Tourani, Siddharth, et al.
Published: (2024)