Saved in:
| Main Authors: | Fadaei, Amir Hosein, Dehaqani, Mohammad-Reza A. |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2311.00800 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification
by: Ho, Darryl, et al.
Published: (2025)
by: Ho, Darryl, et al.
Published: (2025)
TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition
by: Hassan, Imtiaz Ul, et al.
Published: (2026)
by: Hassan, Imtiaz Ul, et al.
Published: (2026)
High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery
by: Peng, Hongxing, et al.
Published: (2025)
by: Peng, Hongxing, et al.
Published: (2025)
PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025)
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025)
CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)
by: Diller, Christian, et al.
Published: (2023)
Deep Learning-based Depth Estimation Methods from Monocular Image and Videos: A Comprehensive Survey
by: Rajapaksha, Uchitha, et al.
Published: (2024)
by: Rajapaksha, Uchitha, et al.
Published: (2024)
Adapting SAM with Dynamic Similarity Graphs for Few-Shot Parameter-Efficient Small Dense Object Detection: A Case Study of Chickpea Pods in Field Conditions
by: Jiang, Xintong, et al.
Published: (2025)
by: Jiang, Xintong, et al.
Published: (2025)
Sign language recognition based on deep learning and low-cost handcrafted descriptors
by: Carneiro, Alvaro Leandro Cavalcante, et al.
Published: (2024)
by: Carneiro, Alvaro Leandro Cavalcante, et al.
Published: (2024)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
Capacity Constraint Analysis Using Object Detection for Smart Manufacturing
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)
FutureHuman3D: Forecasting Complex Long-Term 3D Human Behavior from Video Observations
by: Diller, Christian, et al.
Published: (2022)
by: Diller, Christian, et al.
Published: (2022)
Decoupling Vision and Language: Codebook Anchored Visual Adaptation
by: Wu, Jason, et al.
Published: (2026)
by: Wu, Jason, et al.
Published: (2026)
FlowDet: Overcoming Perspective and Scale Challenges in Real-Time End-to-End Traffic Detection
by: Wang, Zixing, et al.
Published: (2025)
by: Wang, Zixing, et al.
Published: (2025)
UGOD: Uncertainty-Guided Differentiable Opacity and Soft Dropout for Enhanced Sparse-View 3DGS
by: Guo, Zhihao, et al.
Published: (2025)
by: Guo, Zhihao, et al.
Published: (2025)
CARScenes: Semantic VLM Dataset for Safe Autonomous Driving
by: He, Yuankai, et al.
Published: (2025)
by: He, Yuankai, et al.
Published: (2025)
Detecting AI-Generated Videos with Spiking Neural Networks
by: Jang, Minsuk, et al.
Published: (2026)
by: Jang, Minsuk, et al.
Published: (2026)
Butter: Frequency Consistency and Hierarchical Fusion for Autonomous Driving Object Detection
by: Lin, Xiaojian, et al.
Published: (2025)
by: Lin, Xiaojian, et al.
Published: (2025)
Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis
by: Korolkov, Vasilii
Published: (2025)
by: Korolkov, Vasilii
Published: (2025)
SpectralCA: Bi-Directional Cross-Attention for Next-Generation UAV Hyperspectral Vision
by: Brovko, D. V.
Published: (2025)
by: Brovko, D. V.
Published: (2025)
Object detection in adverse weather conditions for autonomous vehicles using Instruct Pix2Pix
by: Gurbindo, Unai, et al.
Published: (2025)
by: Gurbindo, Unai, et al.
Published: (2025)
Implementing Adaptations for Vision AutoRegressive Model
by: Shaikh, Kaif, et al.
Published: (2025)
by: Shaikh, Kaif, et al.
Published: (2025)
Prompt Sensitivity in Vision-Language Grounding: How Small Changes in Wording Affect Object Detection
by: Deka, Dawar Jyoti, et al.
Published: (2026)
by: Deka, Dawar Jyoti, et al.
Published: (2026)
IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion
by: Mishra, Shashank, et al.
Published: (2025)
by: Mishra, Shashank, et al.
Published: (2025)
LiftAvatar: Kinematic-Space Completion for Expression-Controlled 3D Gaussian Avatar Animation
by: Wei, Hualiang, et al.
Published: (2026)
by: Wei, Hualiang, et al.
Published: (2026)
SelvaMask: Segmenting Trees in Tropical Forests and Beyond
by: Duguay, Simon-Olivier, et al.
Published: (2026)
by: Duguay, Simon-Olivier, et al.
Published: (2026)
Salient Concept-Aware Generative Data Augmentation
by: Zhao, Tianchen, et al.
Published: (2025)
by: Zhao, Tianchen, et al.
Published: (2025)
A deep learning approach to track eye movements based on events
by: Seth, Chirag, et al.
Published: (2025)
by: Seth, Chirag, et al.
Published: (2025)
Domain-Adaptive Pretraining Improves Primate Behavior Recognition
by: Mueller, Felix B., et al.
Published: (2025)
by: Mueller, Felix B., et al.
Published: (2025)
Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
by: Wang, Yiming, et al.
Published: (2026)
by: Wang, Yiming, et al.
Published: (2026)
Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos
by: Li, Yayuan, et al.
Published: (2025)
by: Li, Yayuan, et al.
Published: (2025)
SelvaBox: A high-resolution dataset for tropical tree crown detection
by: Baudchon, Hugo, et al.
Published: (2025)
by: Baudchon, Hugo, et al.
Published: (2025)
DeltaVLM: Interactive Remote Sensing Image Change Analysis via Instruction-guided Difference Perception
by: Deng, Pei, et al.
Published: (2025)
by: Deng, Pei, et al.
Published: (2025)
Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
by: Salazar, Jorge Yero, et al.
Published: (2024)
by: Salazar, Jorge Yero, et al.
Published: (2024)
Dense Motion Captioning
by: Xu, Shiyao, et al.
Published: (2025)
by: Xu, Shiyao, et al.
Published: (2025)
TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading
by: Lee, Byung Hoon, et al.
Published: (2025)
by: Lee, Byung Hoon, et al.
Published: (2025)
CoMatcher: Multi-View Collaborative Feature Matching
by: Zhang, Jintao, et al.
Published: (2025)
by: Zhang, Jintao, et al.
Published: (2025)
MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition
by: Fan, Qiannan, et al.
Published: (2025)
by: Fan, Qiannan, et al.
Published: (2025)
NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
by: Li, Zilin, et al.
Published: (2025)
by: Li, Zilin, et al.
Published: (2025)
Predictive Modeling of Maritime Radar Data Using Transformer Architecture
by: Qesaraku, Bjorna, et al.
Published: (2025)
by: Qesaraku, Bjorna, et al.
Published: (2025)
Similar Items
-
DejaVid: Encoder-Agnostic Learned Temporal Matching for Video Classification
by: Ho, Darryl, et al.
Published: (2025) -
TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition
by: Hassan, Imtiaz Ul, et al.
Published: (2026) -
High-Frequency Semantics and Geometric Priors for End-to-End Detection Transformers in Challenging UAV Imagery
by: Peng, Hongxing, et al.
Published: (2025) -
PhysicsNeRF: Physics-Guided 3D Reconstruction from Sparse Views
by: Barhdadi, Mohamed Rayan, et al.
Published: (2025) -
CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)