:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hashemifard, Kooshan, Climent-Pérez, Pau, Florez-Revuelta, Francisco
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.04509
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Action Recognition in Real-World Ambient Assisted Living Environment
by: Zakka, Vincent Gbouna, et al.
Published: (2025)

Enhanced Aiot Multi‐Modal Fusion for Human Activity Recognition in Ambient Assisted Living Environment
by: Ankit D. Patel, et al.
Published: (2024)

Multi-view Video-Pose Pretraining for Operating Room Surgical Activity Recognition
by: Hamoud, Idris, et al.
Published: (2025)

PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living
by: Robinson, Fraser, et al.
Published: (2026)

Multi-Modal Gesture Recognition from Video and Surgical Tool Pose Information via Motion Invariants
by: Atoum, Jumanh, et al.
Published: (2025)

ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection
by: M, Nandakishor, et al.
Published: (2025)

End-to-End Multi-Person Pose Estimation with Pose-Aware Video Transformer
by: Yu, Yonghui, et al.
Published: (2025)

Paving the Way Towards Kinematic Assessment Using Monocular Video: A Preclinical Benchmark of State-of-the-Art Deep-Learning-Based 3D Human Pose Estimators Against Inertial Sensors in Daily Living Activities
by: Medrano-Paredes, Mario, et al.
Published: (2025)

Learning Frequency and Memory-Aware Prompts for Multi-Modal Object Tracking
by: Xu, Boyue, et al.
Published: (2025)

Predicting Penalty Kick Direction Using Multi-Modal Deep Learning with Pose-Guided Attention
by: Ranasinghe, Pasindu, et al.
Published: (2025)

Detection, Recognition and Pose Estimation of Tabletop Objects
by: Nirgude, Sanjuksha, et al.
Published: (2024)

PoseGAM: Robust Unseen Object Pose Estimation via Geometry-Aware Multi-View Reasoning
by: Chen, Jianqi, et al.
Published: (2025)

Demo-Pose: Depth-Monocular Modality Fusion For Object Pose Estimation
by: Agarwal, Rachit, et al.
Published: (2026)

Deep Learning Pose Estimation for Multi-Label Recognition of Combined Hyperkinetic Movement Disorders
by: Cif, Laura, et al.
Published: (2026)

TSM-Pose: Topology-Aware Learning with Semantic Mamba for Category-Level Object Pose Estimation
by: Liu, Jinshuo, et al.
Published: (2026)

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation
by: Huang, Jiehui, et al.
Published: (2025)

Post-Hurricane Debris Segmentation Using Fine-Tuned Foundational Vision Models
by: Amini, Kooshan, et al.
Published: (2025)

Deep Adversarial Learning with Activity-Based User Discrimination Task for Human Activity Recognition
by: Calatrava-Nicolás, Francisco M., et al.
Published: (2024)

Continual Multimodal Egocentric Activity Recognition via Modality-Aware Novel Detection
by: Lim, Wonseon, et al.
Published: (2026)

PoseTraj: Pose-Aware Trajectory Control in Video Diffusion
by: Ji, Longbin, et al.
Published: (2025)

Towards LLM-Powered Ambient Sensor Based Multi-Person Human Activity Recognition
by: Chen, Xi, et al.
Published: (2024)

Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking
by: Wang, Shilei, et al.
Published: (2026)

Reliable Multi-Modal Object Re-Identification via Modality-Aware Graph Reasoning
by: Wan, Xixi, et al.
Published: (2025)

Can Text-to-image Model Assist Multi-modal Learning for Visual Recognition with Visual Modality Missing?
by: Feng, Tiantian, et al.
Published: (2024)

IDSelect: A RL-Based Cost-Aware Selection Agent for Video-based Multi-Modal Person Recognition
by: Ji, Yuyang, et al.
Published: (2026)

X Modality Assisting RGBT Object Tracking
by: Ding, Zhaisheng, et al.
Published: (2023)

TransPose: 6D Object Pose Estimation with Geometry-Aware Transformer
by: Lin, Xiao, et al.
Published: (2023)

Deep Learning Approaches for Human Action Recognition in Video Data
by: Xie, Yufei
Published: (2024)

A Distributed Multi-Modal Sensing Approach for Human Activity Recognition in Real-Time Human-Robot Collaboration
by: Belcamino, Valerio, et al.
Published: (2026)

Deep Learning-Based Object Pose Estimation: A Comprehensive Survey
by: Liu, Jian, et al.
Published: (2024)

Group Activity Recognition using Unreliable Tracked Pose
by: Thilakarathne, Haritha, et al.
Published: (2024)

Language-Assisted Deep Learning for Autistic Behaviors Recognition
by: Deng, Andong, et al.
Published: (2022)

Multi-Modal Monocular Endoscopic Depth and Pose Estimation with Edge-Guided Self-Supervision
by: Ju, Xinwei, et al.
Published: (2026)

BEVPose: Unveiling Scene Semantics through Pose-Guided Multi-Modal BEV Alignment
by: Hosseinzadeh, Mehdi, et al.
Published: (2024)

Modality-Agnostic Prompt Learning for Multi-Modal Camouflaged Object Detection
by: Wang, Hao, et al.
Published: (2026)

Object Pose Estimation through Dexterous Touch
by: Shahidzadeh, Amir-Hossein, et al.
Published: (2025)

UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
by: Li, Ming-Feng, et al.
Published: (2025)

MAPRPose: Mask-Aware Proposal and Amodal Refinement for Multi-Object 6D Pose Estimation
by: Luo, Yang, et al.
Published: (2026)

Pixels or Positions? Benchmarking Modalities in Group Activity Recognition
by: Karki, Drishya, et al.
Published: (2025)

Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos
by: Sommer, Leonhard, et al.
Published: (2024)