Saved in:
| Main Authors: | Tan, Weipeng, Lin, Chuming, Xu, Chengming, Ji, Xiaozhong, Zhu, Junwei, Wang, Chengjie, Wu, Yunsheng, Fu, Yanwei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.03270 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025)
by: Tan, Weipeng, et al.
Published: (2025)
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
by: Ji, Xiaozhong, et al.
Published: (2024)
by: Ji, Xiaozhong, et al.
Published: (2024)
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)
by: Ji, Xiaozhong, et al.
Published: (2024)
VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
by: Wang, Qilin, et al.
Published: (2024)
by: Wang, Qilin, et al.
Published: (2024)
ArtWeaver: Advanced Dynamic Style Integration via Diffusion Model
by: Xu, Chengming, et al.
Published: (2024)
by: Xu, Chengming, et al.
Published: (2024)
HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
by: Xu, Zunnan, et al.
Published: (2025)
by: Xu, Zunnan, et al.
Published: (2025)
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style
by: Tan, Shuai, et al.
Published: (2024)
by: Tan, Shuai, et al.
Published: (2024)
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
by: Jiang, Boyuan, et al.
Published: (2024)
by: Jiang, Boyuan, et al.
Published: (2024)
MegActor: Harness the Power of Raw Video for Vivid Portrait Animation
by: Yang, Shurong, et al.
Published: (2024)
by: Yang, Shurong, et al.
Published: (2024)
When Preferences Diverge: Aligning Diffusion Models with Minority-Aware Adaptive DPO
by: Zhang, Lingfan, et al.
Published: (2025)
by: Zhang, Lingfan, et al.
Published: (2025)
Splat-Portrait: Generalizing Talking Heads with Gaussian Splatting
by: Shi, Tong, et al.
Published: (2026)
by: Shi, Tong, et al.
Published: (2026)
IM-Portrait: Learning 3D-aware Video Diffusion for Photorealistic Talking Heads from Monocular Videos
by: Li, Yuan, et al.
Published: (2025)
by: Li, Yuan, et al.
Published: (2025)
MVPortrait: Text-Guided Motion and Emotion Control for Multi-view Vivid Portrait Animation
by: Lin, Yukang, et al.
Published: (2025)
by: Lin, Yukang, et al.
Published: (2025)
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
by: Wang, Suzhen, et al.
Published: (2024)
by: Wang, Suzhen, et al.
Published: (2024)
VTBench: Comprehensive Benchmark Suite Towards Real-World Virtual Try-on Models
by: Xiaobin, Hu, et al.
Published: (2025)
by: Xiaobin, Hu, et al.
Published: (2025)
EDTalk++: Full Disentanglement for Controllable Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2025)
by: Tan, Shuai, et al.
Published: (2025)
UniM-OV3D: Uni-Modality Open-Vocabulary 3D Scene Understanding with Fine-Grained Feature Representation
by: He, Qingdong, et al.
Published: (2024)
by: He, Qingdong, et al.
Published: (2024)
SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment
by: Sun, Yanxiao, et al.
Published: (2025)
by: Sun, Yanxiao, et al.
Published: (2025)
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles
by: Ma, Yifeng, et al.
Published: (2023)
by: Ma, Yifeng, et al.
Published: (2023)
Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation
by: Tan, Jintao, et al.
Published: (2024)
by: Tan, Jintao, et al.
Published: (2024)
A Comprehensive Taxonomy and Analysis of Talking Head Synthesis: Techniques for Portrait Generation, Driving Mechanisms, and Editing
by: Meng, Ming, et al.
Published: (2024)
by: Meng, Ming, et al.
Published: (2024)
FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing
by: Huang, Xijie, et al.
Published: (2026)
by: Huang, Xijie, et al.
Published: (2026)
CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
by: Luo, Donghao, et al.
Published: (2025)
by: Luo, Donghao, et al.
Published: (2025)
FixTalk: Taming Identity Leakage for High-Quality Talking Head Generation in Extreme Cases
by: Tan, Shuai, et al.
Published: (2025)
by: Tan, Shuai, et al.
Published: (2025)
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
by: Tan, Shuai, et al.
Published: (2024)
by: Tan, Shuai, et al.
Published: (2024)
Vivid-ZOO: Multi-View Video Generation with Diffusion Model
by: Li, Bing, et al.
Published: (2024)
by: Li, Bing, et al.
Published: (2024)
PSPU: Enhanced Positive and Unlabeled Learning by Leveraging Pseudo Supervision
by: Wang, Chengjie, et al.
Published: (2024)
by: Wang, Chengjie, et al.
Published: (2024)
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
by: Zhang, Na, et al.
Published: (2025)
by: Zhang, Na, et al.
Published: (2025)
T-Pixel2Mesh: Combining Global and Local Transformer for 3D Mesh Generation from a Single Image
by: Zhang, Shijie, et al.
Published: (2024)
by: Zhang, Shijie, et al.
Published: (2024)
Towards Global Optimal Visual In-Context Learning Prompt Selection
by: Xu, Chengming, et al.
Published: (2024)
by: Xu, Chengming, et al.
Published: (2024)
Domain Generalizable Portrait Style Transfer
by: Wang, Xinbo, et al.
Published: (2025)
by: Wang, Xinbo, et al.
Published: (2025)
A Generalization Theory of Cross-Modality Distillation with Contrastive Learning
by: Lin, Hangyu, et al.
Published: (2024)
by: Lin, Hangyu, et al.
Published: (2024)
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
by: Wang, Mengchao, et al.
Published: (2025)
by: Wang, Mengchao, et al.
Published: (2025)
Robust Network Learning via Inverse Scale Variational Sparsification
by: Zhou, Zhiling, et al.
Published: (2024)
by: Zhou, Zhiling, et al.
Published: (2024)
SVFR: A Unified Framework for Generalized Video Face Restoration
by: Wang, Zhiyao, et al.
Published: (2025)
by: Wang, Zhiyao, et al.
Published: (2025)
TexDreamer: Towards Zero-Shot High-Fidelity 3D Human Texture Generation
by: Liu, Yufei, et al.
Published: (2024)
by: Liu, Yufei, et al.
Published: (2024)
EFCNet: Every Feature Counts for Small Medical Object Segmentation
by: Kong, Lingjie, et al.
Published: (2024)
by: Kong, Lingjie, et al.
Published: (2024)
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models
by: Ma, Yifeng, et al.
Published: (2023)
by: Ma, Yifeng, et al.
Published: (2023)
NSFW-Classifier Guided Prompt Sanitization for Safe Text-to-Image Generation
by: Xie, Yu, et al.
Published: (2025)
by: Xie, Yu, et al.
Published: (2025)
DiffFAE: Advancing High-fidelity One-shot Facial Appearance Editing with Space-sensitive Customization and Semantic Preservation
by: Wang, Qilin, et al.
Published: (2024)
by: Wang, Qilin, et al.
Published: (2024)
Similar Items
-
Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation
by: Tan, Weipeng, et al.
Published: (2025) -
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D Facial Prior-guided Identity Alignment Network
by: Ji, Xiaozhong, et al.
Published: (2024) -
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024) -
VividPose: Advancing Stable Video Diffusion for Realistic Human Image Animation
by: Wang, Qilin, et al.
Published: (2024) -
ArtWeaver: Advanced Dynamic Style Integration via Diffusion Model
by: Xu, Chengming, et al.
Published: (2024)