Saved in:
| Main Authors: | Di, Xinhan, Qi, Kristin, Yu, Pengqian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.20987 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Preview WB-DH: Towards Whole Body Digital Human Bench for the Generation of Whole-body Talking Avatar Videos
by: Wang, Chaoyi, et al.
Published: (2025)
by: Wang, Chaoyi, et al.
Published: (2025)
Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation
by: Qi, Kristin, et al.
Published: (2025)
by: Qi, Kristin, et al.
Published: (2025)
LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters
by: Zhang, Haomin, et al.
Published: (2025)
by: Zhang, Haomin, et al.
Published: (2025)
Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation
by: Zhao, Shuting, et al.
Published: (2024)
by: Zhao, Shuting, et al.
Published: (2024)
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars
by: Zhang, Youliang, et al.
Published: (2026)
by: Zhang, Youliang, et al.
Published: (2026)
OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
by: Gan, Qijun, et al.
Published: (2025)
by: Gan, Qijun, et al.
Published: (2025)
Audio-Driven Talking Face Video Generation with Joint Uncertainty Learning
by: Xie, Yifan, et al.
Published: (2025)
by: Xie, Yifan, et al.
Published: (2025)
Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute
by: Gong, Chaoqun, et al.
Published: (2024)
by: Gong, Chaoqun, et al.
Published: (2024)
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
by: Cheng, Hanbo, et al.
Published: (2024)
by: Cheng, Hanbo, et al.
Published: (2024)
Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture
by: Li, Xuanchen, et al.
Published: (2025)
by: Li, Xuanchen, et al.
Published: (2025)
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
by: Wang, Yuchi, et al.
Published: (2024)
by: Wang, Yuchi, et al.
Published: (2024)
SoulX-FlashTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation
by: Shen, Le, et al.
Published: (2025)
by: Shen, Le, et al.
Published: (2025)
RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)
by: Cheng, Wuxinlin, et al.
Published: (2024)
RAW: Robust Avatar Watermarking -- Benchmarking and Baseline
by: Parry, Jack, et al.
Published: (2026)
by: Parry, Jack, et al.
Published: (2026)
WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control
by: Jiang, Haoran, et al.
Published: (2025)
by: Jiang, Haoran, et al.
Published: (2025)
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
by: Chen, Hejia, et al.
Published: (2025)
by: Chen, Hejia, et al.
Published: (2025)
HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models
by: Xie, Zhifeng, et al.
Published: (2024)
by: Xie, Zhifeng, et al.
Published: (2024)
Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars
by: Austin, Derek
Published: (2026)
by: Austin, Derek
Published: (2026)
Text-Driven Emotionally Continuous Talking Face Generation
by: Yang, Hao, et al.
Published: (2026)
by: Yang, Hao, et al.
Published: (2026)
Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation
by: Shi, Hanlei, et al.
Published: (2025)
by: Shi, Hanlei, et al.
Published: (2025)
Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
by: An, Sojung, et al.
Published: (2025)
by: An, Sojung, et al.
Published: (2025)
MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
by: Gu, Shengbo, et al.
Published: (2025)
by: Gu, Shengbo, et al.
Published: (2025)
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
by: Chen, Bo, et al.
Published: (2025)
by: Chen, Bo, et al.
Published: (2025)
PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
by: Wang, Baiqin, et al.
Published: (2025)
by: Wang, Baiqin, et al.
Published: (2025)
Automated Lesion Segmentation in Whole-Body PET/CT in a multitracer setting
by: Xue, Qiaoyi, et al.
Published: (2024)
by: Xue, Qiaoyi, et al.
Published: (2024)
DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
by: Wei, Jiabao, et al.
Published: (2024)
by: Wei, Jiabao, et al.
Published: (2024)
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model
by: Liu, Kangwei, et al.
Published: (2025)
by: Liu, Kangwei, et al.
Published: (2025)
SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space
by: Zhang, Zeren, et al.
Published: (2024)
by: Zhang, Zeren, et al.
Published: (2024)
Is It Really You? Exploring Biometric Verification Scenarios in Photorealistic Talking-Head Avatar Videos
by: Pedrouzo-Rodriguez, Laura, et al.
Published: (2025)
by: Pedrouzo-Rodriguez, Laura, et al.
Published: (2025)
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
by: Wang, Haotian, et al.
Published: (2024)
by: Wang, Haotian, et al.
Published: (2024)
MAGE:A Multi-stage Avatar Generator with Sparse Observations
by: Du, Fangyu, et al.
Published: (2025)
by: Du, Fangyu, et al.
Published: (2025)
The autoPET3 Challenge: Automated Lesion Segmentation in Whole-Body PET/CT $\unicode{x2013}$ Multitracer Multicenter Generalization
by: Dexl, Jakob, et al.
Published: (2026)
by: Dexl, Jakob, et al.
Published: (2026)
F3G-Avatar : Face Focused Full-body Gaussian Avatar
by: Menu, Willem, et al.
Published: (2026)
by: Menu, Willem, et al.
Published: (2026)
AMG: Avatar Motion Guided Video Generation
by: Yang, Zhangsihao, et al.
Published: (2024)
by: Yang, Zhangsihao, et al.
Published: (2024)
EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors
by: Zhang, Shikun, et al.
Published: (2025)
by: Zhang, Shikun, et al.
Published: (2025)
Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations
by: Huang, Wei-Jin, et al.
Published: (2026)
by: Huang, Wei-Jin, et al.
Published: (2026)
GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces
by: Yu, Hao, et al.
Published: (2025)
by: Yu, Hao, et al.
Published: (2025)
LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging
by: Rokuss, Maximilian, et al.
Published: (2025)
by: Rokuss, Maximilian, et al.
Published: (2025)
3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information
by: Wen, Sihan, et al.
Published: (2024)
by: Wen, Sihan, et al.
Published: (2024)
Similar Items
-
Preview WB-DH: Towards Whole Body Digital Human Bench for the Generation of Whole-body Talking Avatar Videos
by: Wang, Chaoyi, et al.
Published: (2025) -
Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation
by: Qi, Kristin, et al.
Published: (2025) -
LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters
by: Zhang, Haomin, et al.
Published: (2025) -
Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation
by: Zhao, Shuting, et al.
Published: (2024) -
Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars
by: Zhang, Youliang, et al.
Published: (2026)