Saved in:
| Main Authors: | Huang, Yihuan, Liu, Jiajun, Ren, Yanzhen, Xue, Jun, Liu, Wuyang, Sun, Zongkun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.05803 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
by: Sun, Zhiyao, et al.
Published: (2023)
by: Sun, Zhiyao, et al.
Published: (2023)
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis
by: Daněček, Radek, et al.
Published: (2025)
by: Daněček, Radek, et al.
Published: (2025)
OT-Talk: Animating 3D Talking Head with Optimal Transportation
by: Wang, Xinmu, et al.
Published: (2025)
by: Wang, Xinmu, et al.
Published: (2025)
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
by: Chae-Yeon, Lee, et al.
Published: (2025)
by: Chae-Yeon, Lee, et al.
Published: (2025)
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset
by: Sung-Bin, Kim, et al.
Published: (2024)
by: Sung-Bin, Kim, et al.
Published: (2024)
PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
by: Xie, Yifan, et al.
Published: (2024)
by: Xie, Yifan, et al.
Published: (2024)
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)
by: Wang, Haotian, et al.
Published: (2025)
StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025)
by: Shi, Chengwei, et al.
Published: (2025)
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
by: Li, Xinyang, et al.
Published: (2025)
by: Li, Xinyang, et al.
Published: (2025)
NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis
by: Liu, Xiaoxing, et al.
Published: (2025)
by: Liu, Xiaoxing, et al.
Published: (2025)
Lightning Fast Caching-based Parallel Denoising Prediction for Accelerating Talking Head Generation
by: Long, Jianzhi, et al.
Published: (2025)
by: Long, Jianzhi, et al.
Published: (2025)
Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert
by: EunGi, Han, et al.
Published: (2024)
by: EunGi, Han, et al.
Published: (2024)
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
by: Airale, Louis, et al.
Published: (2023)
by: Airale, Louis, et al.
Published: (2023)
Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation
by: Mao, Yuxiang, et al.
Published: (2025)
by: Mao, Yuxiang, et al.
Published: (2025)
SyncLight: Single-Edit Multi-View Relighting
by: Serrano-Lozano, David, et al.
Published: (2026)
by: Serrano-Lozano, David, et al.
Published: (2026)
VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
by: Li, Wuyang, et al.
Published: (2025)
by: Li, Wuyang, et al.
Published: (2025)
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis
by: Shen, Shuai, et al.
Published: (2025)
by: Shen, Shuai, et al.
Published: (2025)
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
by: Xiang, Jun, et al.
Published: (2024)
by: Xiang, Jun, et al.
Published: (2024)
SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
by: Liu, Yuan, et al.
Published: (2023)
by: Liu, Yuan, et al.
Published: (2023)
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
by: Peng, Ziqiao, et al.
Published: (2023)
by: Peng, Ziqiao, et al.
Published: (2023)
Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
by: Zhang, Zeyi, et al.
Published: (2024)
by: Zhang, Zeyi, et al.
Published: (2024)
ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
by: Guan, Jiazhi, et al.
Published: (2024)
by: Guan, Jiazhi, et al.
Published: (2024)
Learn2Talk: 3D Talking Face Learns from 2D Talking Face
by: Zhuang, Yixiang, et al.
Published: (2024)
by: Zhuang, Yixiang, et al.
Published: (2024)
SyncSDE: A Probabilistic Framework for Diffusion Synchronization
by: Lee, Hyunjun, et al.
Published: (2025)
by: Lee, Hyunjun, et al.
Published: (2025)
In-Context Sync-LoRA for Portrait Video Editing
by: Polaczek, Sagi, et al.
Published: (2025)
by: Polaczek, Sagi, et al.
Published: (2025)
Tolerance-Aware Deep Optics
by: Dai, Jun, et al.
Published: (2025)
by: Dai, Jun, et al.
Published: (2025)
EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers
by: Flynn, John, et al.
Published: (2026)
by: Flynn, John, et al.
Published: (2026)
Profiling the Voice: Speaker-Specific Phoneme Fingerprinting for Speech Deepfake Detection
by: Xue, Jun, et al.
Published: (2026)
by: Xue, Jun, et al.
Published: (2026)
PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads
by: Verma, Shashikant, et al.
Published: (2025)
by: Verma, Shashikant, et al.
Published: (2025)
GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling
by: Liu, Pinxin, et al.
Published: (2025)
by: Liu, Pinxin, et al.
Published: (2025)
Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions
by: Prinzler, Malte, et al.
Published: (2024)
by: Prinzler, Malte, et al.
Published: (2024)
3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)
by: Liu, Ruiqi, et al.
Published: (2024)
TextToon: Real-Time Text Toonify Head Avatar from Single Video
by: Song, Luchuan, et al.
Published: (2024)
by: Song, Luchuan, et al.
Published: (2024)
Geometry-Aware Texture Generation for 3D Head Modeling with Artist-driven Control
by: Fadaeinejad, Amin, et al.
Published: (2025)
by: Fadaeinejad, Amin, et al.
Published: (2025)
FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding
by: Xiang, Jun, et al.
Published: (2023)
by: Xiang, Jun, et al.
Published: (2023)
LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation
by: Lu, Xin, et al.
Published: (2025)
by: Lu, Xin, et al.
Published: (2025)
HHAvatar: Gaussian Head Avatar with Dynamic Hairs
by: Liao, Zhanfeng, et al.
Published: (2023)
by: Liao, Zhanfeng, et al.
Published: (2023)
SEGA: Drivable 3D Gaussian Head Avatar from a Single Image
by: Guo, Chen, et al.
Published: (2025)
by: Guo, Chen, et al.
Published: (2025)
FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
by: Aneja, Shivangi, et al.
Published: (2023)
by: Aneja, Shivangi, et al.
Published: (2023)
VRMM: A Volumetric Relightable Morphable Head Model
by: Yang, Haotian, et al.
Published: (2024)
by: Yang, Haotian, et al.
Published: (2024)
Similar Items
-
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
by: Sun, Zhiyao, et al.
Published: (2023) -
Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis
by: Daněček, Radek, et al.
Published: (2025) -
OT-Talk: Animating 3D Talking Head with Optimal Transportation
by: Wang, Xinmu, et al.
Published: (2025) -
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
by: Chae-Yeon, Lee, et al.
Published: (2025) -
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset
by: Sung-Bin, Kim, et al.
Published: (2024)