Saved in:
| Main Authors: | Hashemifard, Kooshan, Climent-Pérez, Pau, Florez-Revuelta, Francisco |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04509 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Action Recognition in Real-World Ambient Assisted Living Environment
by: Zakka, Vincent Gbouna, et al.
Published: (2025)
by: Zakka, Vincent Gbouna, et al.
Published: (2025)
Enhanced Aiot Multi‐Modal Fusion for Human Activity Recognition in Ambient Assisted Living Environment
by: Ankit D. Patel, et al.
Published: (2024)
by: Ankit D. Patel, et al.
Published: (2024)
Multi-view Video-Pose Pretraining for Operating Room Surgical Activity Recognition
by: Hamoud, Idris, et al.
Published: (2025)
by: Hamoud, Idris, et al.
Published: (2025)
PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living
by: Robinson, Fraser, et al.
Published: (2026)
by: Robinson, Fraser, et al.
Published: (2026)
Multi-Modal Gesture Recognition from Video and Surgical Tool Pose Information via Motion Invariants
by: Atoum, Jumanh, et al.
Published: (2025)
by: Atoum, Jumanh, et al.
Published: (2025)
ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
by: M, Nandakishor, et al.
Published: (2025)
by: M, Nandakishor, et al.
Published: (2025)
End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
by: Yu, Yonghui, et al.
Published: (2025)
by: Yu, Yonghui, et al.
Published: (2025)
Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities
by: Medrano-Paredes, Mario, et al.
Published: (2025)
by: Medrano-Paredes, Mario, et al.
Published: (2025)
Learning Frequency and Memory-Aware Prompts for Multi-Modal Object Tracking
by: Xu, Boyue, et al.
Published: (2025)
by: Xu, Boyue, et al.
Published: (2025)
Predicting Penalty Kick Direction Using Multi-Modal Deep Learning with Pose-Guided Attention
by: Ranasinghe, Pasindu, et al.
Published: (2025)
by: Ranasinghe, Pasindu, et al.
Published: (2025)
Detection, Recognition and Pose Estimation of Tabletop Objects
by: Nirgude, Sanjuksha, et al.
Published: (2024)
by: Nirgude, Sanjuksha, et al.
Published: (2024)
PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
by: Chen, Jianqi, et al.
Published: (2025)
by: Chen, Jianqi, et al.
Published: (2025)
Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation
by: Agarwal, Rachit, et al.
Published: (2026)
by: Agarwal, Rachit, et al.
Published: (2026)
Deep Learning Pose Estimation for Multi-Label Recognition of Combined Hyperkinetic Movement Disorders
by: Cif, Laura, et al.
Published: (2026)
by: Cif, Laura, et al.
Published: (2026)
TSM-Pose: Topology-Aware Learning with Semantic Mamba for Category-Level Object Pose Estimation
by: Liu, Jinshuo, et al.
Published: (2026)
by: Liu, Jinshuo, et al.
Published: (2026)
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
by: Huang, Jiehui, et al.
Published: (2025)
by: Huang, Jiehui, et al.
Published: (2025)
Post-Hurricane Debris Segmentation Using Fine-Tuned Foundational Vision Models
by: Amini, Kooshan, et al.
Published: (2025)
by: Amini, Kooshan, et al.
Published: (2025)
Deep Adversarial Learning with Activity-Based User Discrimination Task for Human Activity Recognition
by: Calatrava-Nicolás, Francisco M., et al.
Published: (2024)
by: Calatrava-Nicolás, Francisco M., et al.
Published: (2024)
Continual Multimodal Egocentric Activity Recognition via Modality-Aware Novel Detection
by: Lim, Wonseon, et al.
Published: (2026)
by: Lim, Wonseon, et al.
Published: (2026)
PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
by: Ji, Longbin, et al.
Published: (2025)
by: Ji, Longbin, et al.
Published: (2025)
Towards LLM-Powered Ambient Sensor Based Multi-Person Human Activity Recognition
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking
by: Wang, Shilei, et al.
Published: (2026)
by: Wang, Shilei, et al.
Published: (2026)
Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning
by: Wan, Xixi, et al.
Published: (2025)
by: Wan, Xixi, et al.
Published: (2025)
Can Text-to-image Model Assist Multi-modal Learning for Visual Recognition with Visual Modality Missing?
by: Feng, Tiantian, et al.
Published: (2024)
by: Feng, Tiantian, et al.
Published: (2024)
IDSelect: A RL-Based Cost-Aware Selection Agent for Video-based Multi-Modal Person Recognition
by: Ji, Yuyang, et al.
Published: (2026)
by: Ji, Yuyang, et al.
Published: (2026)
X Modality Assisting RGBT Object Tracking
by: Ding, Zhaisheng, et al.
Published: (2023)
by: Ding, Zhaisheng, et al.
Published: (2023)
TransPose: 6D Object Pose Estimation with Geometry-Aware Transformer
by: Lin, Xiao, et al.
Published: (2023)
by: Lin, Xiao, et al.
Published: (2023)
Deep Learning Approaches for Human Action Recognition in Video Data
by: Xie, Yufei
Published: (2024)
by: Xie, Yufei
Published: (2024)
A Distributed Multi-Modal Sensing Approach for Human Activity Recognition in Real-Time Human-Robot Collaboration
by: Belcamino, Valerio, et al.
Published: (2026)
by: Belcamino, Valerio, et al.
Published: (2026)
Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
by: Liu, Jian, et al.
Published: (2024)
by: Liu, Jian, et al.
Published: (2024)
Group Activity Recognition using Unreliable Tracked Pose
by: Thilakarathne, Haritha, et al.
Published: (2024)
by: Thilakarathne, Haritha, et al.
Published: (2024)
Language-Assisted Deep Learning for Autistic Behaviors Recognition
by: Deng, Andong, et al.
Published: (2022)
by: Deng, Andong, et al.
Published: (2022)
Multi-Modal Monocular Endoscopic Depth and Pose Estimation with Edge-Guided Self-Supervision
by: Ju, Xinwei, et al.
Published: (2026)
by: Ju, Xinwei, et al.
Published: (2026)
BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
by: Hosseinzadeh, Mehdi, et al.
Published: (2024)
by: Hosseinzadeh, Mehdi, et al.
Published: (2024)
Modality-Agnostic Prompt Learning for Multi-Modal Camouflaged Object Detection
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Object Pose Estimation through Dexterous Touch
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
by: Li, Ming-Feng, et al.
Published: (2025)
by: Li, Ming-Feng, et al.
Published: (2025)
MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation
by: Luo, Yang, et al.
Published: (2026)
by: Luo, Yang, et al.
Published: (2026)
Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
by: Karki, Drishya, et al.
Published: (2025)
by: Karki, Drishya, et al.
Published: (2025)
Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos
by: Sommer, Leonhard, et al.
Published: (2024)
by: Sommer, Leonhard, et al.
Published: (2024)
Similar Items
-
Action Recognition in Real-World Ambient Assisted Living Environment
by: Zakka, Vincent Gbouna, et al.
Published: (2025) -
Enhanced Aiot Multi‐Modal Fusion for Human Activity Recognition in Ambient Assisted Living Environment
by: Ankit D. Patel, et al.
Published: (2024) -
Multi-view Video-Pose Pretraining for Operating Room Surgical Activity Recognition
by: Hamoud, Idris, et al.
Published: (2025) -
PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living
by: Robinson, Fraser, et al.
Published: (2026) -
Multi-Modal Gesture Recognition from Video and Surgical Tool Pose Information via Motion Invariants
by: Atoum, Jumanh, et al.
Published: (2025)