:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ji, Xiaozhong, Lin, Chuming, Ding, Zhonggan, Tai, Ying, Zhu, Junwei, Hu, Xiaobin, Luo, Donghao, Ge, Yanhao, Wang, Chengjie
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.18284
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)

RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis
by: Wang, Wenqing, et al.
Published: (2025)

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
by: Tan, Weipeng, et al.
Published: (2024)

A Generalist FaceX via Learning Unified Facial Representation
by: Han, Yue, et al.
Published: (2023)

RealTalk-CN: A Realistic Chinese Speech-Text Dialogue Benchmark With Cross-Modal Interaction Analysis
by: Wang, Enzhi, et al.
Published: (2025)

VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models
by: Xiaobin, Hu, et al.
Published: (2025)

HiFiVFS: High Fidelity Video Face Swapping
by: Chen, Xu, et al.
Published: (2024)

Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
by: NVIDIA, et al.
Published: (2025)

Talking Face Generation With Lip and Identity Priors
by: Jiajie Wu, et al.
Published: (2025)

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
by: Luo, Donghao, et al.
Published: (2025)

VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots
by: Li, Peizhen, et al.
Published: (2026)

PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
by: Wang, Baiqin, et al.
Published: (2025)

DiffMagicFace: Identity Consistent Facial Editing of Real Videos
by: Yin, Huanghao, et al.
Published: (2026)

Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
by: Yaman, Dogucan, et al.
Published: (2025)

MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning
by: Han, Yue, et al.
Published: (2024)

DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
by: Hu, Xiaobin, et al.
Published: (2024)

VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
by: Xu, Sicheng, et al.
Published: (2024)

TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens
by: Zhao, Qingcheng, et al.
Published: (2026)

FaceEditTalker: Controllable Talking Head Generation with Facial Attribute Editing
by: Feng, Guanwen, et al.
Published: (2025)

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
by: Wang, Qilin, et al.
Published: (2024)

On the attainability of the singular Wiener bound
by: Huang, Zhonggan
Published: (2025)

Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
by: Choi, Jeongsoo, et al.
Published: (2023)

MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
by: Nazarieh, Fatemeh, et al.
Published: (2025)

Audio-driven Talking Face Generation with Stabilized Synchronization Loss
by: Yaman, Dogucan, et al.
Published: (2023)

GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting
by: Chen, Bo, et al.
Published: (2024)

Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
by: Han, Yue, et al.
Published: (2024)

VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
by: Liang, Yujie, et al.
Published: (2024)

READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)

FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
by: Xu, Chao, et al.
Published: (2024)

A Revisit to Recast Efficacy in L2 Learning: An Alignment Perspective
by: Haiyan Miao, et al.
Published: (2024)

JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation
by: Chakkera, Sai Tanmay Reddy, et al.
Published: (2024)

OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
by: Ge, Shuheng, et al.
Published: (2024)

SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)

IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
by: Chen, Bo, et al.
Published: (2025)

AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
by: Liu, Tao, et al.
Published: (2024)

FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
by: Ni, Ziqi, et al.
Published: (2025)

Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models
by: Liu, Yuansen, et al.
Published: (2025)

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
by: Park, Se Jin, et al.
Published: (2024)

Collaborative Face Experts Fusion in Video Generation: Boosting Identity Consistency Across Large Face Poses
by: Wang, Yuji, et al.
Published: (2025)