Saved in:
| Main Authors: | Feng, Han, Ma, Wenchao, Gao, Quankai, Zheng, Xianwei, Xue, Nan, Xu, Huijuan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.20786 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations
by: Zhao, Shuting, et al.
Published: (2025)
by: Zhao, Shuting, et al.
Published: (2025)
SentiAvatar: Towards Expressive and Interactive Digital Humans
by: Jin, Chuhao, et al.
Published: (2026)
by: Jin, Chuhao, et al.
Published: (2026)
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025)
by: Sun, Zhiyao, et al.
Published: (2025)
GAZEploit: Remote Keystroke Inference Attack by Gaze Estimation from Avatar Views in VR/MR Devices
by: Wang, Hanqiu, et al.
Published: (2024)
by: Wang, Hanqiu, et al.
Published: (2024)
The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars
by: Benyamin, Yarin
Published: (2026)
by: Benyamin, Yarin
Published: (2026)
Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs
by: Bao, Yiming, et al.
Published: (2024)
by: Bao, Yiming, et al.
Published: (2024)
QueryCraft: Transformer-Guided Query Initialization for Enhanced Human-Object Interaction Detection
by: Wang, Yuxiao, et al.
Published: (2025)
by: Wang, Yuxiao, et al.
Published: (2025)
Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters
by: Bai, Zechen, et al.
Published: (2024)
by: Bai, Zechen, et al.
Published: (2024)
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
by: Ki, Taekyung, et al.
Published: (2026)
by: Ki, Taekyung, et al.
Published: (2026)
SASG-DA: Sparse-Aware Semantic-Guided Diffusion Augmentation For Myoelectric Gesture Recognition
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
G3R: Generating Rich and Fine-grained mmWave Radar Data from 2D Videos for Generalized Gesture Recognition
by: Deng, Kaikai, et al.
Published: (2024)
by: Deng, Kaikai, et al.
Published: (2024)
RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)
by: Cheng, Wuxinlin, et al.
Published: (2024)
POET: Supporting Prompting Creativity and Personalization with Automated Expansion of Text-to-Image Generation
by: Han, Evans Xu, et al.
Published: (2025)
by: Han, Evans Xu, et al.
Published: (2025)
Do MLLMs Understand Pointing? Benchmarking and Enhancing Referential Reasoning in Egocentric Vision
by: Li, Chentao, et al.
Published: (2026)
by: Li, Chentao, et al.
Published: (2026)
See Through Their Minds: Learning Transferable Neural Representation from Cross-Subject fMRI
by: Liu, Yulong, et al.
Published: (2024)
by: Liu, Yulong, et al.
Published: (2024)
MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices
by: Xu, Vasco, et al.
Published: (2025)
by: Xu, Vasco, et al.
Published: (2025)
When Less Is More: A Sparse Facial Motion Structure For Listening Motion Learning
by: Nguyen, Tri Tung Nguyen, et al.
Published: (2025)
by: Nguyen, Tri Tung Nguyen, et al.
Published: (2025)
GesGPT: Speech Gesture Synthesis With Text Parsing from ChatGPT
by: Gao, Nan, et al.
Published: (2023)
by: Gao, Nan, et al.
Published: (2023)
BATON: A Multimodal Benchmark for Bidirectional Automation Transition Observation in Naturalistic Driving
by: Wang, Yuhang, et al.
Published: (2026)
by: Wang, Yuhang, et al.
Published: (2026)
Referring Human Pose and Mask Estimation in the Wild
by: Miao, Bo, et al.
Published: (2024)
by: Miao, Bo, et al.
Published: (2024)
Customizable Avatars with Dynamic Facial Action Coded Expressions (CADyFACE) for Improved User Engagement
by: Witherow, Megan A., et al.
Published: (2024)
by: Witherow, Megan A., et al.
Published: (2024)
Resource-Efficient Gesture Recognition using Low-Resolution Thermal Camera via Spiking Neural Networks and Sparse Segmentation
by: Safa, Ali, et al.
Published: (2024)
by: Safa, Ali, et al.
Published: (2024)
VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality
by: Jiang, Ying, et al.
Published: (2024)
by: Jiang, Ying, et al.
Published: (2024)
UST-Hand: An Uncertainty-aware Spatiotemporal Point Cloud Interaction Network for 3D Self-supervised Hand Pose Estimation
by: Han, Tianhao, et al.
Published: (2026)
by: Han, Tianhao, et al.
Published: (2026)
EIT-1M: One Million EEG-Image-Text Pairs for Human Visual-textual Recognition and More
by: Zheng, Xu, et al.
Published: (2024)
by: Zheng, Xu, et al.
Published: (2024)
MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
by: Xu, Zunnan, et al.
Published: (2024)
by: Xu, Zunnan, et al.
Published: (2024)
From Development to Deployment of AI-assisted Telehealth and Screening for Vision- and Hearing-threatening diseases in resource-constrained settings: Field Observations, Challenges and Way Forward
by: Shakya, Mahesh, et al.
Published: (2025)
by: Shakya, Mahesh, et al.
Published: (2025)
3DPFIX: Improving Remote Novices' 3D Printing Troubleshooting through Human-AI Collaboration
by: Kwon, Nahyun, et al.
Published: (2024)
by: Kwon, Nahyun, et al.
Published: (2024)
VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation
by: Pan, Bo, et al.
Published: (2025)
by: Pan, Bo, et al.
Published: (2025)
OW-CLIP: Data-Efficient Visual Supervision for Open-World Object Detection via Human-AI Collaboration
by: Duan, Junwen, et al.
Published: (2025)
by: Duan, Junwen, et al.
Published: (2025)
LLM4Brain: Training a Large Language Model for Brain Video Understanding
by: Zheng, Ruizhe, et al.
Published: (2024)
by: Zheng, Ruizhe, et al.
Published: (2024)
Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration
by: Renz, Heiko, et al.
Published: (2025)
by: Renz, Heiko, et al.
Published: (2025)
Steering Generative Models for Accessibility: EasyRead Image Generation
by: Dickenmann, Nicolas, et al.
Published: (2026)
by: Dickenmann, Nicolas, et al.
Published: (2026)
Coupled Confusion Correction: Learning from Crowds with Sparse Annotations
by: Zhang, Hansong, et al.
Published: (2023)
by: Zhang, Hansong, et al.
Published: (2023)
Vid2Coach: Transforming How-To Videos into Task Assistants
by: Huh, Mina, et al.
Published: (2025)
by: Huh, Mina, et al.
Published: (2025)
HAGI++: Head-Assisted Gaze Imputation and Generation
by: Jiao, Chuhan, et al.
Published: (2025)
by: Jiao, Chuhan, et al.
Published: (2025)
Garment Inertial Denoiser (GID): Endowing Accurate Motion Capture via Loose IMU Denoiser
by: Fang, Jiawei, et al.
Published: (2026)
by: Fang, Jiawei, et al.
Published: (2026)
Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model
by: He, Xu, et al.
Published: (2024)
by: He, Xu, et al.
Published: (2024)
Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding
by: Guo, Hao, et al.
Published: (2025)
by: Guo, Hao, et al.
Published: (2025)
GCCRR: A Short Sequence Gait Cycle Segmentation Method Based on Ear-Worn IMU
by: Xu, Zhenye, et al.
Published: (2024)
by: Xu, Zhenye, et al.
Published: (2024)
Similar Items
-
SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations
by: Zhao, Shuting, et al.
Published: (2025) -
SentiAvatar: Towards Expressive and Interactive Digital Humans
by: Jin, Chuhao, et al.
Published: (2026) -
StreamAvatar: Streaming Diffusion Models for Real-Time Interactive Human Avatars
by: Sun, Zhiyao, et al.
Published: (2025) -
GAZEploit: Remote Keystroke Inference Attack by Gaze Estimation from Avatar Views in VR/MR Devices
by: Wang, Hanqiu, et al.
Published: (2024) -
The Latency Wall: Benchmarking Off-the-Shelf Emotion Recognition for Real-Time Virtual Avatars
by: Benyamin, Yarin
Published: (2026)