:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Feng, Han, Ma, Wenchao, Gao, Quankai, Zheng, Xianwei, Xue, Nan, Xu, Huijuan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Human-Computer Interaction
Online Access:	https://arxiv.org/abs/2405.20786
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations
by: Zhao, Shuting, et al.
Published: (2025)

SentiAvatar: Towards Expressive and Interactive Digital Humans
by: Jin, Chuhao, et al.
Published: (2026)

StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025)

GAZEploit: Remote Keystroke Inference Attack by Gaze Estimation from Avatar Views in VR/MR Devices
by: Wang, Hanqiu, et al.
Published: (2024)

The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars
by: Benyamin, Yarin
Published: (2026)

Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs
by: Bao, Yiming, et al.
Published: (2024)

QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
by: Wang, Yuxiao, et al.
Published: (2025)

Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters
by: Bai, Zechen, et al.
Published: (2024)

Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)

SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition
by: Liu, Chen, et al.
Published: (2025)

G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition
by: Deng, Kaikai, et al.
Published: (2024)

RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)

POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation
by: Han, Evans Xu, et al.
Published: (2025)

Do MLLMs Understand Pointing? Benchmarking and Enhancing Referential Reasoning in Egocentric Vision
by: Li, Chentao, et al.
Published: (2026)

See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
by: Liu, Yulong, et al.
Published: (2024)

MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices
by: Xu, Vasco, et al.
Published: (2025)

When Less Is More: A Sparse Facial Motion Structure For Listening Motion Learning
by: Nguyen, Tri Tung Nguyen, et al.
Published: (2025)

GesGPT: Speech Gesture Synthesis With Text Parsing from ChatGPT
by: Gao, Nan, et al.
Published: (2023)

BATON: A Multimodal Benchmark for Bidirectional Automation Transition Observation in Naturalistic Driving
by: Wang, Yuhang, et al.
Published: (2026)

Referring Human Pose and Mask Estimation in the Wild
by: Miao, Bo, et al.
Published: (2024)

Customizable Avatars with Dynamic Facial Action Coded Expressions (CADyFACE) for Improved User Engagement
by: Witherow, Megan A., et al.
Published: (2024)

Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation
by: Safa, Ali, et al.
Published: (2024)

VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality
by: Jiang, Ying, et al.
Published: (2024)

UST-Hand: An Uncertainty-aware Spatiotemporal Point Cloud Interaction Network for 3D Self-supervised Hand Pose Estimation
by: Han, Tianhao, et al.
Published: (2026)

EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More
by: Zheng, Xu, et al.
Published: (2024)

MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
by: Xu, Zunnan, et al.
Published: (2024)

From Development to Deployment of AI-assisted Telehealth and Screening for Vision- and Hearing-threatening diseases in resource-constrained settings: Field Observations, Challenges and Way Forward
by: Shakya, Mahesh, et al.
Published: (2025)

3DPFIX: Improving Remote Novices' 3D Printing Troubleshooting through Human-AI Collaboration
by: Kwon, Nahyun, et al.
Published: (2024)

VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation
by: Pan, Bo, et al.
Published: (2025)

OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration
by: Duan, Junwen, et al.
Published: (2025)

LLM4Brain: Training a Large Language Model for Brain Video Understanding
by: Zheng, Ruizhe, et al.
Published: (2024)

Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration
by: Renz, Heiko, et al.
Published: (2025)

Steering Generative Models for Accessibility: EasyRead Image Generation
by: Dickenmann, Nicolas, et al.
Published: (2026)

Coupled Confusion Correction: Learning from Crowds with Sparse Annotations
by: Zhang, Hansong, et al.
Published: (2023)

Vid2Coach: Transforming How-To Videos into Task Assistants
by: Huh, Mina, et al.
Published: (2025)

HAGI++: Head-Assisted Gaze Imputation and Generation
by: Jiao, Chuhan, et al.
Published: (2025)

Garment Inertial Denoiser (GID): Endowing Accurate Motion Capture via Loose IMU Denoiser
by: Fang, Jiawei, et al.
Published: (2026)

Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
by: He, Xu, et al.
Published: (2024)

Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
by: Guo, Hao, et al.
Published: (2025)

GCCRR: A Short Sequence Gait Cycle Segmentation Method Based on Ear-Worn IMU
by: Xu, Zhenye, et al.
Published: (2024)