Saved in:
| Main Authors: | Ji, Xiaozhong, Lin, Chuming, Ding, Zhonggan, Tai, Ying, Zhu, Junwei, Hu, Xiaobin, Luo, Donghao, Ge, Yanhao, Wang, Chengjie |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.18284 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)
by: Tan, Weipeng, et al.
Published: (2025)
RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis
by: Wang, Wenqing, et al.
Published: (2025)
by: Wang, Wenqing, et al.
Published: (2025)
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)
by: Ji, Xiaozhong, et al.
Published: (2024)
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
by: Tan, Weipeng, et al.
Published: (2024)
by: Tan, Weipeng, et al.
Published: (2024)
A Generalist FaceX via Learning Unified Facial Representation
by: Han, Yue, et al.
Published: (2023)
by: Han, Yue, et al.
Published: (2023)
RealTalk-CN: A Realistic Chinese Speech-Text Dialogue Benchmark With Cross-Modal Interaction Analysis
by: Wang, Enzhi, et al.
Published: (2025)
by: Wang, Enzhi, et al.
Published: (2025)
VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models
by: Xiaobin, Hu, et al.
Published: (2025)
by: Xiaobin, Hu, et al.
Published: (2025)
HiFiVFS: High Fidelity Video Face Swapping
by: Chen, Xu, et al.
Published: (2024)
by: Chen, Xu, et al.
Published: (2024)
Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
by: NVIDIA, et al.
Published: (2025)
by: NVIDIA, et al.
Published: (2025)
Talking Face Generation With Lip and Identity Priors
by: Jiajie Wu, et al.
Published: (2025)
by: Jiajie Wu, et al.
Published: (2025)
CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
by: Luo, Donghao, et al.
Published: (2025)
by: Luo, Donghao, et al.
Published: (2025)
VividFace: Real-Time and Realistic Facial Expression Shadowing for Humanoid Robots
by: Li, Peizhen, et al.
Published: (2026)
by: Li, Peizhen, et al.
Published: (2026)
PC-Talk: Precise Facial Animation Control for Audio-Driven Talking Face Generation
by: Wang, Baiqin, et al.
Published: (2025)
by: Wang, Baiqin, et al.
Published: (2025)
DiffMagicFace: Identity Consistent Facial Editing of Real Videos
by: Yin, Huanghao, et al.
Published: (2026)
by: Yin, Huanghao, et al.
Published: (2026)
Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation
by: Yaman, Dogucan, et al.
Published: (2025)
by: Yaman, Dogucan, et al.
Published: (2025)
MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning
by: Han, Yue, et al.
Published: (2024)
by: Han, Yue, et al.
Published: (2024)
DiffuMatting: Synthesizing Arbitrary Objects with Matting-level Annotation
by: Hu, Xiaobin, et al.
Published: (2024)
by: Hu, Xiaobin, et al.
Published: (2024)
VASA-1: Lifelike Audio-Driven Talking Faces Generated in Real Time
by: Xu, Sicheng, et al.
Published: (2024)
by: Xu, Sicheng, et al.
Published: (2024)
TokTalk: Expressive Real-time Facial Animation from Audio-LLM Tokens
by: Zhao, Qingcheng, et al.
Published: (2026)
by: Zhao, Qingcheng, et al.
Published: (2026)
FaceEditTalker: Controllable Talking Head Generation with Facial Attribute Editing
by: Feng, Guanwen, et al.
Published: (2025)
by: Feng, Guanwen, et al.
Published: (2025)
DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
by: Wang, Qilin, et al.
Published: (2024)
by: Wang, Qilin, et al.
Published: (2024)
On the attainability of the singular Wiener bound
by: Huang, Zhonggan
Published: (2025)
by: Huang, Zhonggan
Published: (2025)
Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
by: Choi, Jeongsoo, et al.
Published: (2023)
by: Choi, Jeongsoo, et al.
Published: (2023)
MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
by: Nazarieh, Fatemeh, et al.
Published: (2025)
by: Nazarieh, Fatemeh, et al.
Published: (2025)
Audio-driven Talking Face Generation with Stabilized Synchronization Loss
by: Yaman, Dogucan, et al.
Published: (2023)
by: Yaman, Dogucan, et al.
Published: (2023)
GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting
by: Chen, Bo, et al.
Published: (2024)
by: Chen, Bo, et al.
Published: (2024)
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control
by: Han, Yue, et al.
Published: (2024)
by: Han, Yue, et al.
Published: (2024)
VTON-HandFit: Virtual Try-on for Arbitrary Hand Pose Guided by Hand Priors Embedding
by: Liang, Yujie, et al.
Published: (2024)
by: Liang, Yujie, et al.
Published: (2024)
READ: Real-time and Efficient Asynchronous Diffusion for Audio-driven Talking Head Generation
by: Wang, Haotian, et al.
Published: (2025)
by: Wang, Haotian, et al.
Published: (2025)
FaceChain-ImagineID: Freely Crafting High-Fidelity Diverse Talking Faces from Disentangled Audio
by: Xu, Chao, et al.
Published: (2024)
by: Xu, Chao, et al.
Published: (2024)
A Revisit to Recast Efficacy in L2 Learning: An Alignment Perspective
by: Haiyan Miao, et al.
Published: (2024)
by: Haiyan Miao, et al.
Published: (2024)
JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation
by: Chakkera, Sai Tanmay Reddy, et al.
Published: (2024)
by: Chakkera, Sai Tanmay Reddy, et al.
Published: (2024)
OpFlowTalker: Realistic and Natural Talking Face Generation via Optical Flow Guidance
by: Ge, Shuheng, et al.
Published: (2024)
by: Ge, Shuheng, et al.
Published: (2024)
SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)
by: Wang, Zhiyao, et al.
Published: (2025)
IMTalker: Efficient Audio-driven Talking Face Generation with Implicit Motion Transfer
by: Chen, Bo, et al.
Published: (2025)
by: Chen, Bo, et al.
Published: (2025)
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
by: Liu, Tao, et al.
Published: (2024)
by: Liu, Tao, et al.
Published: (2024)
FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
by: Ni, Ziqi, et al.
Published: (2025)
by: Ni, Ziqi, et al.
Published: (2025)
Human-MME: A Holistic Evaluation Benchmark for Human-Centric Multimodal Large Language Models
by: Liu, Yuansen, et al.
Published: (2025)
by: Liu, Yuansen, et al.
Published: (2025)
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation
by: Park, Se Jin, et al.
Published: (2024)
by: Park, Se Jin, et al.
Published: (2024)
Collaborative Face Experts Fusion in Video Generation: Boosting Identity Consistency Across Large Face Poses
by: Wang, Yuji, et al.
Published: (2025)
by: Wang, Yuji, et al.
Published: (2025)
Similar Items
-
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025) -
RealTalk: Realistic Emotion-Aware Lifelike Talking-Head Synthesis
by: Wang, Wenqing, et al.
Published: (2025) -
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024) -
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model
by: Tan, Weipeng, et al.
Published: (2024) -
A Generalist FaceX via Learning Unified Facial Representation
by: Han, Yue, et al.
Published: (2023)