:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tan, Weipeng, Lin, Chuming, Xu, Chengming, Ji, Xiaozhong, Zhu, Junwei, Wang, Chengjie, Wu, Yunsheng, Fu, Yanwei
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.03270
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)

RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
by: Ji, Xiaozhong, et al.
Published: (2024)

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)

VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
by: Wang, Qilin, et al.
Published: (2024)

ArtWeaver: Advanced Dynamic Style Integration via Diffusion Model
by: Xu, Chengming, et al.
Published: (2024)

HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
by: Xu, Zunnan, et al.
Published: (2025)

Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
by: Tan, Shuai, et al.
Published: (2024)

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
by: Jiang, Boyuan, et al.
Published: (2024)

MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
by: Yang, Shurong, et al.
Published: (2024)

When Preferences Diverge: Aligning Diffusion Models with Minority-Aware Adaptive DPO
by: Zhang, Lingfan, et al.
Published: (2025)

Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting
by: Shi, Tong, et al.
Published: (2026)

IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos
by: Li, Yuan, et al.
Published: (2025)

MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
by: Lin, Yukang, et al.
Published: (2025)

StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
by: Wang, Suzhen, et al.
Published: (2024)

VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models
by: Xiaobin, Hu, et al.
Published: (2025)

EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)

UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
by: He, Qingdong, et al.
Published: (2024)

SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment
by: Sun, Yanxiao, et al.
Published: (2025)

TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
by: Ma, Yifeng, et al.
Published: (2023)

Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
by: Tan, Jintao, et al.
Published: (2024)

A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
by: Meng, Ming, et al.
Published: (2024)

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
by: Huang, Xijie, et al.
Published: (2026)

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
by: Luo, Donghao, et al.
Published: (2025)

FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases
by: Tan, Shuai, et al.
Published: (2025)

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2024)

Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)

PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision
by: Wang, Chengjie, et al.
Published: (2024)

StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
by: Zhang, Na, et al.
Published: (2025)

T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
by: Zhang, Shijie, et al.
Published: (2024)

Towards Global Optimal Visual In-Context Learning Prompt Selection
by: Xu, Chengming, et al.
Published: (2024)

Domain Generalizable Portrait Style Transfer
by: Wang, Xinbo, et al.
Published: (2025)

A Generalization Theory of Cross-Modality Distillation with Contrastive Learning
by: Lin, Hangyu, et al.
Published: (2024)

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
by: Wang, Mengchao, et al.
Published: (2025)

Robust Network Learning via Inverse Scale Variational Sparsification
by: Zhou, Zhiling, et al.
Published: (2024)

SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)

TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
by: Liu, Yufei, et al.
Published: (2024)

EFCNet: Every Feature Counts for Small Medical Object Segmentation
by: Kong, Lingjie, et al.
Published: (2024)

DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models
by: Ma, Yifeng, et al.
Published: (2023)

NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
by: Xie, Yu, et al.
Published: (2025)

DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
by: Wang, Qilin, et al.
Published: (2024)