:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mu, Lingzhou, Liu, Baiji, Zhang, Ruonan, Mo, Guiming, Jin, Jiawei, Zhang, Kai, Huang, Haozhi
Format:	Preprint
Published:	2025
Subjects:	Graphics
Online Access:	https://arxiv.org/abs/2502.19455
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
by: Du, Fangyu, et al.
Published: (2025)

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
by: Wei, Huawei, et al.
Published: (2024)

Portrait Video Editing Empowered by Multimodal Generative Priors
by: Gao, Xuan, et al.
Published: (2024)

Real-time 3D-aware Portrait Video Relighting
by: Cai, Ziqi, et al.
Published: (2024)

MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars
by: Taubner, Felix, et al.
Published: (2025)

ExpPortrait: Expressive Portrait Generation via Personalized Representation
by: Wang, Junyi, et al.
Published: (2026)

Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
by: Cui, Jiahao, et al.
Published: (2024)

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)

In-Context Sync-LoRA for Portrait Video Editing
by: Polaczek, Sagi, et al.
Published: (2025)

EmoFace: Audio-driven Emotional 3D Face Animation
by: Liu, Chang, et al.
Published: (2024)

Managing level of detail through head-tracked peripheral degradation: a model and resulting design principles
by: Watson, Benjamin, et al.
Published: (2025)

Condition Matters in Full-head 3D GANs
by: Li, Heyuan, et al.
Published: (2026)

Hierarchical Vectorization for Portrait Images
by: Fu, Qian, et al.
Published: (2022)

Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
by: Choi, Jeongsoo, et al.
Published: (2023)

Bridging the gap between training and inference in LM-based TTS models
by: Zhang, Ruonan, et al.
Published: (2025)

3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)

Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
by: Mei, Yiqun, et al.
Published: (2025)

Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction
by: Mo, Yifan, et al.
Published: (2025)

ImagenHub: Standardizing the evaluation of conditional image generation models
by: Ku, Max, et al.
Published: (2023)

MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning
by: Zhang, Yi-Yang, et al.
Published: (2025)

Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
by: Zhang, Fan, et al.
Published: (2023)

Online Photon Guiding with 3D Gaussians for Caustics Rendering
by: Huang, Jiawei, et al.
Published: (2024)

VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models
by: Kim, Geonung, et al.
Published: (2025)

CharGen: Fast and Fluent Portrait Modification
by: Dihlmann, Jan-Niklas, et al.
Published: (2025)

Edge‐preserving noise for diffusion models
by: Jente Vandersanden, et al.
Published: (2026)

Unison: Harmonizing Motion, Speech, and Sound for Human-Centric Audio-Video Generation
by: Cheng, Shihao, et al.
Published: (2026)

SimEndoGS: Efficient Data-driven Scene Simulation using Robotic Surgery Videos via Physics-embedded 3D Gaussians
by: Yang, Zhenya, et al.
Published: (2024)

High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
by: Ming, Xin, et al.
Published: (2024)

TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis
by: Li, Menghao, et al.
Published: (2025)

Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
by: NVIDIA, et al.
Published: (2025)

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
by: Zhang, Yunpeng, et al.
Published: (2025)

Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes
by: Jiang, Ruixiang, et al.
Published: (2026)

From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
by: Mu, Xiangyu, et al.
Published: (2025)

Spatially and Temporally Optimized Audio‐Driven Talking Face Generation
by: Biao Dong, et al.
Published: (2024)

D3MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems
by: Zhang, Heng, et al.
Published: (2025)

RDC‐GS: Enhanced 3D Gaussian Splatting for Robust Dash Cam Video Reconstruction
by: Yunong Mao, et al.
Published: (2026)

Holographic Parallax Improves 3D Perceptual Realism
by: Kim, Dongyeon, et al.
Published: (2024)

Managing level of detail through peripheral degradation: Effects on search performance with a head-mounted display
by: Watson, Benjamin, et al.
Published: (2025)

PatternPortrait: Draw Me Like One of Your Scribbles
by: Wieluch, Sabine, et al.
Published: (2024)

Sketch-based Fluid Video Generation Using Motion-Guided Diffusion Models in Still Landscape Images
by: Jin, Hao, et al.
Published: (2025)