:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Yihuan, Liu, Jiajun, Ren, Yanzhen, Xue, Jun, Liu, Wuyang, Sun, Zongkun
Format:	Preprint
Published:	2025
Subjects:	Graphics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2504.05803
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
by: Sun, Zhiyao, et al.
Published: (2023)

Supervising 3D Talking Head Avatars with Analysis-by-Audio-Synthesis
by: Daněček, Radek, et al.
Published: (2025)

OT-Talk: Animating 3D Talking Head with Optimal Transportation
by: Wang, Xinmu, et al.
Published: (2025)

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics
by: Chae-Yeon, Lee, et al.
Published: (2025)

MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset
by: Sung-Bin, Kim, et al.
Published: (2024)

PointTalk: Audio-Driven Dynamic Lip Point Cloud for 3D Gaussian-based Talking Head Synthesis
by: Xie, Yifan, et al.
Published: (2024)

READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)

StyGazeTalk: Learning Stylized Generation of Gaze and Head Dynamics
by: Shi, Chengwei, et al.
Published: (2025)

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
by: Li, Xinyang, et al.
Published: (2025)

NeRF-3DTalker: Neural Radiance Field with 3D Prior Aided Audio Disentanglement for Talking Head Synthesis
by: Liu, Xiaoxing, et al.
Published: (2025)

Lightning Fast Caching-based Parallel Denoising Prediction for Accelerating Talking Head Generation
by: Long, Jianzhi, et al.
Published: (2025)

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert
by: EunGi, Han, et al.
Published: (2024)

A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation
by: Airale, Louis, et al.
Published: (2023)

Learning Disentangled Speech- and Expression-Driven Blendshapes for 3D Talking Face Animation
by: Mao, Yuxiang, et al.
Published: (2025)

SyncLight: Single-Edit Multi-View Relighting
by: Serrano-Lozano, David, et al.
Published: (2026)

VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection
by: Li, Wuyang, et al.
Published: (2025)

Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis
by: Shen, Shuai, et al.
Published: (2025)

One Shot, One Talk: Whole-body Talking Avatar from a Single Image
by: Xiang, Jun, et al.
Published: (2024)

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
by: Liu, Yuan, et al.
Published: (2023)

SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
by: Peng, Ziqiao, et al.
Published: (2023)

Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Synthesis
by: Zhang, Zeyi, et al.
Published: (2024)

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
by: Guan, Jiazhi, et al.
Published: (2024)

Learn2Talk: 3D Talking Face Learns from 2D Talking Face
by: Zhuang, Yixiang, et al.
Published: (2024)

SyncSDE: A Probabilistic Framework for Diffusion Synchronization
by: Lee, Hyunjun, et al.
Published: (2025)

In-Context Sync-LoRA for Portrait Video Editing
by: Polaczek, Sagi, et al.
Published: (2025)

Tolerance-Aware Deep Optics
by: Dai, Jun, et al.
Published: (2025)

EditYourself: Audio-Driven Generation and Manipulation of Talking Head Videos with Diffusion Transformers
by: Flynn, John, et al.
Published: (2026)

Profiling the Voice: Speaker-Specific Phoneme Fingerprinting for Speech Deepfake Detection
by: Xue, Jun, et al.
Published: (2026)

PanoHair: Detailed Hair Strand Synthesis on Volumetric Heads
by: Verma, Shashikant, et al.
Published: (2025)

GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling
by: Liu, Pinxin, et al.
Published: (2025)

Joker: Conditional 3D Head Synthesis with Extreme Facial Expressions
by: Prinzler, Malte, et al.
Published: (2024)

3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)

TextToon: Real-Time Text Toonify Head Avatar from Single Video
by: Song, Luchuan, et al.
Published: (2024)

Geometry-Aware Texture Generation for 3D Head Modeling with Artist-driven Control
by: Fadaeinejad, Amin, et al.
Published: (2025)

FlashAvatar: High-fidelity Head Avatar with Efficient Gaussian Embedding
by: Xiang, Jun, et al.
Published: (2023)

LSF-Animation: Label-Free Speech-Driven Facial Animation via Implicit Feature Representation
by: Lu, Xin, et al.
Published: (2025)

HHAvatar: Gaussian Head Avatar with Dynamic Hairs
by: Liao, Zhanfeng, et al.
Published: (2023)

SEGA: Drivable 3D Gaussian Head Avatar from a Single Image
by: Guo, Chen, et al.
Published: (2025)

FaceTalk: Audio-Driven Motion Diffusion for Neural Parametric Head Models
by: Aneja, Shivangi, et al.
Published: (2023)

VRMM: A Volumetric Relightable Morphable Head Model
by: Yang, Haotian, et al.
Published: (2024)