:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chen, Peiyin, Yang, Zhuowei, Feng, Hui, Jiang, Sheng, Yan, Rui
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.10650
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait
by: Ki, Taekyung, et al.
Published: (2024)

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
by: Wang, Mengchao, et al.
Published: (2025)

DeX-Portrait: Disentangled and Expressive Portrait Animation via Explicit and Latent Motion Representations
by: Shi, Yuxiang, et al.
Published: (2025)

DEMO: A Statistical Perspective for Efficient Image-Text Matching
by: Zhang, Fan, et al.
Published: (2024)

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
by: Ye, Zhenhui, et al.
Published: (2024)

CogPortrait: Fine-Grained Eye-Region Control in Portrait Animation via Hierarchical Agent Planning
by: Feng, He, et al.
Published: (2026)

FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint
by: Tang, Jiapeng, et al.
Published: (2025)

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)

EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)

LVLMs as inspectors: an agentic framework for category-level structural defect annotation
by: Jiang, Sheng, et al.
Published: (2025)

PortraitDirector: A Hierarchical Disentanglement Framework for Controllable and Real-time Facial Reenactment
by: Ji, Chaonan, et al.
Published: (2026)

Toward Fine-Grained Facial Control in 3D Talking Head Generation
by: Xie, Shaoyang, et al.
Published: (2026)

CDST: Color Disentangled Style Transfer for Universal Style Reference Customization
by: Zhang, Shiwen, et al.
Published: (2025)

LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis
by: Li, Tianqi, et al.
Published: (2024)

HM-Talker: Hybrid Motion Modeling for High-Fidelity Talking Head Synthesis
by: Liu, Shiyu, et al.
Published: (2025)

Talk3D: High-Fidelity Talking Portrait Synthesis via Personalized 3D Generative Prior
by: Ko, Jaehoon, et al.
Published: (2024)

FineXtrol: Controllable Motion Generation via Fine-Grained Text
by: Shen, Keming, et al.
Published: (2025)

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement
by: Yu, Runyi, et al.
Published: (2024)

I2VControl: Disentangled and Unified Video Motion Synthesis Control
by: Feng, Wanquan, et al.
Published: (2024)

IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait
by: Yang, Han, et al.
Published: (2025)

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs
by: Du, Yipeng, et al.
Published: (2025)

Bitrate-Controlled Diffusion for Disentangling Motion and Content in Video
by: Li, Xiao, et al.
Published: (2025)

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2024)

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
by: Huang, Jiehui, et al.
Published: (2024)

LIA-X: Interpretable Latent Portrait Animator
by: Wang, Yaohui, et al.
Published: (2025)

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis
by: Gu, Yuming, et al.
Published: (2023)

EmoCAST: Emotional Talking Portrait via Emotive Text Description
by: Jiang, Yiguo, et al.
Published: (2025)

RAW-Flow: Advancing RGB-to-RAW Image Reconstruction with Deterministic Latent Flow Matching
by: Liu, Zhen, et al.
Published: (2026)

UniTalking: A Unified Audio-Video Framework for Talking Portrait Generation
by: Li, Hebeizi, et al.
Published: (2026)

MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control
by: Nazarieh, Fatemeh, et al.
Published: (2025)

PortraitTalk: Towards Customizable One-Shot Audio-to-Talking Face Generation
by: Nazarieh, Fatemeh, et al.
Published: (2024)

MotionCharacter: Fine-Grained Motion Controllable Human Video Generation
by: Fang, Haopeng, et al.
Published: (2024)

Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting
by: Shi, Tong, et al.
Published: (2026)

Visually-Guided Controllable Medical Image Generation via Fine-Grained Semantic Disentanglement
by: Huang, Xin, et al.
Published: (2026)

Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
by: Li, Tianqi, et al.
Published: (2024)

FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
by: Ni, Ziqi, et al.
Published: (2025)

Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait
by: Yang, Chaolong, et al.
Published: (2025)

SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion Transformers
by: Fei, Zhengcong, et al.
Published: (2025)

A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
by: Meng, Ming, et al.
Published: (2024)

Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation
by: Ma, Yue, et al.
Published: (2024)