Saved in:
| Main Authors: | Chen, Peiyin, Yang, Zhuowei, Feng, Hui, Jiang, Sheng, Yan, Rui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10650 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024)
by: Ki, Taekyung, et al.
Published: (2024)
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
by: Wang, Mengchao, et al.
Published: (2025)
by: Wang, Mengchao, et al.
Published: (2025)
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
by: Shi, Yuxiang, et al.
Published: (2025)
by: Shi, Yuxiang, et al.
Published: (2025)
DEMO: A Statistical Perspective for Efficient Image-Text Matching
by: Zhang, Fan, et al.
Published: (2024)
by: Zhang, Fan, et al.
Published: (2024)
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
by: Ye, Zhenhui, et al.
Published: (2024)
by: Ye, Zhenhui, et al.
Published: (2024)
CogPortrait: Fine-Grained Eye-Region Control in Portrait Animation via Hierarchical Agent Planning
by: Feng, He, et al.
Published: (2026)
by: Feng, He, et al.
Published: (2026)
FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint
by: Tang, Jiapeng, et al.
Published: (2025)
by: Tang, Jiapeng, et al.
Published: (2025)
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)
by: Tan, Weipeng, et al.
Published: (2025)
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)
by: Tan, Shuai, et al.
Published: (2025)
LVLMs as inspectors: an agentic framework for category-level structural defect annotation
by: Jiang, Sheng, et al.
Published: (2025)
by: Jiang, Sheng, et al.
Published: (2025)
PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment
by: Ji, Chaonan, et al.
Published: (2026)
by: Ji, Chaonan, et al.
Published: (2026)
Toward Fine-Grained Facial Control in 3D Talking Head Generation
by: Xie, Shaoyang, et al.
Published: (2026)
by: Xie, Shaoyang, et al.
Published: (2026)
CDST: Color Disentangled Style Transfer for Universal Style Reference Customization
by: Zhang, Shiwen, et al.
Published: (2025)
by: Zhang, Shiwen, et al.
Published: (2025)
LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
by: Li, Tianqi, et al.
Published: (2024)
by: Li, Tianqi, et al.
Published: (2024)
HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis
by: Liu, Shiyu, et al.
Published: (2025)
by: Liu, Shiyu, et al.
Published: (2025)
Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior
by: Ko, Jaehoon, et al.
Published: (2024)
by: Ko, Jaehoon, et al.
Published: (2024)
FineXtrol: Controllable Motion Generation via Fine-Grained Text
by: Shen, Keming, et al.
Published: (2025)
by: Shen, Keming, et al.
Published: (2025)
Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
by: Yu, Runyi, et al.
Published: (2024)
by: Yu, Runyi, et al.
Published: (2024)
I2VControl: Disentangled and Unified Video Motion Synthesis Control
by: Feng, Wanquan, et al.
Published: (2024)
by: Feng, Wanquan, et al.
Published: (2024)
IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait
by: Yang, Han, et al.
Published: (2025)
by: Yang, Han, et al.
Published: (2025)
MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs
by: Du, Yipeng, et al.
Published: (2025)
by: Du, Yipeng, et al.
Published: (2025)
Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
by: Li, Xiao, et al.
Published: (2025)
by: Li, Xiao, et al.
Published: (2025)
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2024)
by: Tan, Shuai, et al.
Published: (2024)
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
by: Huang, Jiehui, et al.
Published: (2024)
by: Huang, Jiehui, et al.
Published: (2024)
LIA-X: Interpretable Latent Portrait Animator
by: Wang, Yaohui, et al.
Published: (2025)
by: Wang, Yaohui, et al.
Published: (2025)
DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
by: Gu, Yuming, et al.
Published: (2023)
by: Gu, Yuming, et al.
Published: (2023)
EmoCAST: Emotional Talking Portrait via Emotive Text Description
by: Jiang, Yiguo, et al.
Published: (2025)
by: Jiang, Yiguo, et al.
Published: (2025)
RAW-Flow: Advancing RGB-to-RAW Image Reconstruction with Deterministic Latent Flow Matching
by: Liu, Zhen, et al.
Published: (2026)
by: Liu, Zhen, et al.
Published: (2026)
UniTalking: A Unified Audio-Video Framework for Talking Portrait Generation
by: Li, Hebeizi, et al.
Published: (2026)
by: Li, Hebeizi, et al.
Published: (2026)
MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
by: Nazarieh, Fatemeh, et al.
Published: (2025)
by: Nazarieh, Fatemeh, et al.
Published: (2025)
PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
by: Nazarieh, Fatemeh, et al.
Published: (2024)
by: Nazarieh, Fatemeh, et al.
Published: (2024)
MotionCharacter: Fine-Grained Motion Controllable Human Video Generation
by: Fang, Haopeng, et al.
Published: (2024)
by: Fang, Haopeng, et al.
Published: (2024)
Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting
by: Shi, Tong, et al.
Published: (2026)
by: Shi, Tong, et al.
Published: (2026)
Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement
by: Huang, Xin, et al.
Published: (2026)
by: Huang, Xin, et al.
Published: (2026)
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
by: Li, Tianqi, et al.
Published: (2024)
by: Li, Tianqi, et al.
Published: (2024)
FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
by: Ni, Ziqi, et al.
Published: (2025)
by: Ni, Ziqi, et al.
Published: (2025)
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
by: Yang, Chaolong, et al.
Published: (2025)
by: Yang, Chaolong, et al.
Published: (2025)
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers
by: Fei, Zhengcong, et al.
Published: (2025)
by: Fei, Zhengcong, et al.
Published: (2025)
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
by: Meng, Ming, et al.
Published: (2024)
by: Meng, Ming, et al.
Published: (2024)
Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
by: Ma, Yue, et al.
Published: (2024)
by: Ma, Yue, et al.
Published: (2024)
Similar Items
-
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024) -
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
by: Wang, Mengchao, et al.
Published: (2025) -
DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
by: Shi, Yuxiang, et al.
Published: (2025) -
DEMO: A Statistical Perspective for Efficient Image-Text Matching
by: Zhang, Fan, et al.
Published: (2024) -
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
by: Ye, Zhenhui, et al.
Published: (2024)