:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Di, Xinhan, Qi, Kristin, Yu, Pengqian
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.20987
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Preview WB-DH: Towards Whole Body Digital Human Bench for the Generation of Whole-body Talking Avatar Videos
by: Wang, Chaoyi, et al.
Published: (2025)

Attentional Triple-Encoder Network in Spatiospectral Domains for Medical Image Segmentation
by: Qi, Kristin, et al.
Published: (2025)

LD-LAudio-V1: Video-to-Long-Form-Audio Generation Extension with Dual Lightweight Adapters
by: Zhang, Haomin, et al.
Published: (2025)

Towards Full-parameter and Parameter-efficient Self-learning For Endoscopic Camera Depth Estimation
by: Zhao, Shuting, et al.
Published: (2024)

Making Avatars Interact: Towards Text-Driven Human-Object Interaction for Controllable Talking Avatars
by: Zhang, Youliang, et al.
Published: (2026)

OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation
by: Gan, Qijun, et al.
Published: (2025)

Audio-Driven Talking Face Video Generation with Joint Uncertainty Learning
by: Xie, Yifan, et al.
Published: (2025)

Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute
by: Gong, Chaoqun, et al.
Published: (2024)

DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation
by: Cheng, Hanbo, et al.
Published: (2024)

Towards High-fidelity 3D Talking Avatar with Personalized Dynamic Texture
by: Li, Xuanchen, et al.
Published: (2025)

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
by: Wang, Yuchi, et al.
Published: (2024)

SoulX-FlashTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation
by: Shen, Le, et al.
Published: (2025)

RITA: A Real-time Interactive Talking Avatars Framework
by: Cheng, Wuxinlin, et al.
Published: (2024)

RAW: Robust Avatar Watermarking -- Benchmarking and Baseline
by: Parry, Jack, et al.
Published: (2026)

WholeBodyVLA: Towards Unified Latent VLA for Whole-Body Loco-Manipulation Control
by: Jiang, Haoran, et al.
Published: (2025)

Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
by: Chen, Hejia, et al.
Published: (2025)

HieraFashDiff: Hierarchical Fashion Design with Multi-stage Diffusion Models
by: Xie, Zhifeng, et al.
Published: (2024)

Better Rigs, Not Bigger Networks: A Body Model Ablation for Gaussian Avatars
by: Austin, Derek
Published: (2026)

Text-Driven Emotionally Continuous Talking Face Generation
by: Yang, Hao, et al.
Published: (2026)

Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation
by: Shi, Hanlei, et al.
Published: (2025)

Talk in Pieces, See in Whole: Disentangling and Hierarchical Aggregating Representations for Language-based Object Detection
by: An, Sojung, et al.
Published: (2025)

MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
by: Gu, Shengbo, et al.
Published: (2025)

IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
by: Chen, Bo, et al.
Published: (2025)

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
by: Wang, Baiqin, et al.
Published: (2025)

Automated Lesion Segmentation in Whole-Body PET/CT in a multitracer setting
by: Xue, Qiaoyi, et al.
Published: (2024)

DH-VTON: Deep Text-Driven Virtual Try-On via Hybrid Attention Learning
by: Wei, Jiabao, et al.
Published: (2024)

Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
by: Yu, Hao, et al.
Published: (2025)

DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model
by: Liu, Kangwei, et al.
Published: (2025)

SwapTalk: Audio-Driven Talking Face Generation with One-Shot Customization in Latent Space
by: Zhang, Zeren, et al.
Published: (2024)

Is It Really You? Exploring Biometric Verification Scenarios in Photorealistic Talking-Head Avatar Videos
by: Pedrouzo-Rodriguez, Laura, et al.
Published: (2025)

EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion
by: Wang, Haotian, et al.
Published: (2024)

MAGE:A Multi-stage Avatar Generator with Sparse Observations
by: Du, Fangyu, et al.
Published: (2025)

The autoPET3 Challenge: Automated Lesion Segmentation in Whole-Body PET/CT $\unicode{x2013}$ Multitracer Multicenter Generalization
by: Dexl, Jakob, et al.
Published: (2026)

F3G-Avatar : Face Focused Full-body Gaussian Avatar
by: Menu, Willem, et al.
Published: (2026)

AMG: Avatar Motion Guided Video Generation
by: Yang, Zhangsihao, et al.
Published: (2024)

EAvatar: Expression-Aware Head Avatar Reconstruction with Generative Geometry Priors
by: Zhang, Shikun, et al.
Published: (2025)

Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations
by: Huang, Wei-Jin, et al.
Published: (2026)

GenEAva: Generating Cartoon Avatars with Fine-Grained Facial Expressions from Realistic Diffusion-based Faces
by: Yu, Hao, et al.
Published: (2025)

LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging
by: Rokuss, Maximilian, et al.
Published: (2025)

3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information
by: Wen, Sihan, et al.
Published: (2024)