Saved in:
| Main Authors: | Mu, Lingzhou, Liu, Baiji, Zhang, Ruonan, Mo, Guiming, Jin, Jiawei, Zhang, Kai, Huang, Haozhi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.19455 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
by: Du, Fangyu, et al.
Published: (2025)
by: Du, Fangyu, et al.
Published: (2025)
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
by: Wei, Huawei, et al.
Published: (2024)
by: Wei, Huawei, et al.
Published: (2024)
Portrait Video Editing Empowered by Multimodal Generative Priors
by: Gao, Xuan, et al.
Published: (2024)
by: Gao, Xuan, et al.
Published: (2024)
Real-time 3D-aware Portrait Video Relighting
by: Cai, Ziqi, et al.
Published: (2024)
by: Cai, Ziqi, et al.
Published: (2024)
MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars
by: Taubner, Felix, et al.
Published: (2025)
by: Taubner, Felix, et al.
Published: (2025)
ExpPortrait: Expressive Portrait Generation via Personalized Representation
by: Wang, Junyi, et al.
Published: (2026)
by: Wang, Junyi, et al.
Published: (2026)
Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer
by: Cui, Jiahao, et al.
Published: (2024)
by: Cui, Jiahao, et al.
Published: (2024)
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
by: Ji, Xiaozhong, et al.
Published: (2024)
by: Ji, Xiaozhong, et al.
Published: (2024)
In-Context Sync-LoRA for Portrait Video Editing
by: Polaczek, Sagi, et al.
Published: (2025)
by: Polaczek, Sagi, et al.
Published: (2025)
EmoFace: Audio-driven Emotional 3D Face Animation
by: Liu, Chang, et al.
Published: (2024)
by: Liu, Chang, et al.
Published: (2024)
Managing level of detail through head-tracked peripheral degradation: a model and resulting design principles
by: Watson, Benjamin, et al.
Published: (2025)
by: Watson, Benjamin, et al.
Published: (2025)
Condition Matters in Full-head 3D GANs
by: Li, Heyuan, et al.
Published: (2026)
by: Li, Heyuan, et al.
Published: (2026)
Hierarchical Vectorization for Portrait Images
by: Fu, Qian, et al.
Published: (2022)
by: Fu, Qian, et al.
Published: (2022)
Text-driven Talking Face Synthesis by Reprogramming Audio-driven Models
by: Choi, Jeongsoo, et al.
Published: (2023)
by: Choi, Jeongsoo, et al.
Published: (2023)
Bridging the gap between training and inference in LM-based TTS models
by: Zhang, Ruonan, et al.
Published: (2025)
by: Zhang, Ruonan, et al.
Published: (2025)
3D-SSGAN: Lifting 2D Semantics for 3D-Aware Compositional Portrait Synthesis
by: Liu, Ruiqi, et al.
Published: (2024)
by: Liu, Ruiqi, et al.
Published: (2024)
Lux Post Facto: Learning Portrait Performance Relighting with Conditional Video Diffusion and a Hybrid Dataset
by: Mei, Yiqun, et al.
Published: (2025)
by: Mei, Yiqun, et al.
Published: (2025)
Plug-and-Play PDE Optimization for 3D Gaussian Splatting: Toward High-Quality Rendering and Reconstruction
by: Mo, Yifan, et al.
Published: (2025)
by: Mo, Yifan, et al.
Published: (2025)
ImagenHub: Standardizing the evaluation of conditional image generation models
by: Ku, Max, et al.
Published: (2023)
by: Ku, Max, et al.
Published: (2023)
MotionDuet: Dual-Conditioned 3D Human Motion Generation with Video-Regularized Text Learning
by: Zhang, Yi-Yang, et al.
Published: (2025)
by: Zhang, Yi-Yang, et al.
Published: (2025)
Audio is all in one: speech-driven gesture synthetics using WavLM pre-trained model
by: Zhang, Fan, et al.
Published: (2023)
by: Zhang, Fan, et al.
Published: (2023)
Online Photon Guiding with 3D Gaussians for Caustics Rendering
by: Huang, Jiawei, et al.
Published: (2024)
by: Huang, Jiawei, et al.
Published: (2024)
VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models
by: Kim, Geonung, et al.
Published: (2025)
by: Kim, Geonung, et al.
Published: (2025)
CharGen: Fast and Fluent Portrait Modification
by: Dihlmann, Jan-Niklas, et al.
Published: (2025)
by: Dihlmann, Jan-Niklas, et al.
Published: (2025)
Edge‐preserving noise for diffusion models
by: Jente Vandersanden, et al.
Published: (2026)
by: Jente Vandersanden, et al.
Published: (2026)
Unison: Harmonizing Motion, Speech, and Sound for Human-Centric Audio-Video Generation
by: Cheng, Shihao, et al.
Published: (2026)
by: Cheng, Shihao, et al.
Published: (2026)
SimEndoGS: Efficient Data-driven Scene Simulation using Robotic Surgery Videos via Physics-embedded 3D Gaussians
by: Yang, Zhenya, et al.
Published: (2024)
by: Yang, Zhenya, et al.
Published: (2024)
High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering
by: Ming, Xin, et al.
Published: (2024)
by: Ming, Xin, et al.
Published: (2024)
TransVDM: Motion-Constrained Video Diffusion Model for Transparent Video Synthesis
by: Li, Menghao, et al.
Published: (2025)
by: Li, Menghao, et al.
Published: (2025)
Audio2Face-3D: Audio-driven Realistic Facial Animation For Digital Avatars
by: NVIDIA, et al.
Published: (2025)
by: NVIDIA, et al.
Published: (2025)
FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation
by: Zhang, Yunpeng, et al.
Published: (2025)
by: Zhang, Yunpeng, et al.
Published: (2025)
Before the Shutter: Aesthetic and Actionable Portrait Photography Planning in 3D Scenes
by: Jiang, Ruixiang, et al.
Published: (2026)
by: Jiang, Ruixiang, et al.
Published: (2026)
From Mannequin to Human: A Pose-Aware and Identity-Preserving Video Generation Framework for Lifelike Clothing Display
by: Mu, Xiangyu, et al.
Published: (2025)
by: Mu, Xiangyu, et al.
Published: (2025)
Spatially and Temporally Optimized Audio‐Driven Talking Face Generation
by: Biao Dong, et al.
Published: (2024)
by: Biao Dong, et al.
Published: (2024)
D3MAS: Decompose, Deduce, and Distribute for Enhanced Knowledge Sharing in Multi-Agent Systems
by: Zhang, Heng, et al.
Published: (2025)
by: Zhang, Heng, et al.
Published: (2025)
RDC‐GS: Enhanced 3D Gaussian Splatting for Robust Dash Cam Video Reconstruction
by: Yunong Mao, et al.
Published: (2026)
by: Yunong Mao, et al.
Published: (2026)
Holographic Parallax Improves 3D Perceptual Realism
by: Kim, Dongyeon, et al.
Published: (2024)
by: Kim, Dongyeon, et al.
Published: (2024)
Managing level of detail through peripheral degradation: Effects on search performance with a head-mounted display
by: Watson, Benjamin, et al.
Published: (2025)
by: Watson, Benjamin, et al.
Published: (2025)
PatternPortrait: Draw Me Like One of Your Scribbles
by: Wieluch, Sabine, et al.
Published: (2024)
by: Wieluch, Sabine, et al.
Published: (2024)
Sketch-based Fluid Video Generation Using Motion-Guided Diffusion Models in Still Landscape Images
by: Jin, Hao, et al.
Published: (2025)
by: Jin, Hao, et al.
Published: (2025)
Similar Items
-
RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer
by: Du, Fangyu, et al.
Published: (2025) -
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
by: Wei, Huawei, et al.
Published: (2024) -
Portrait Video Editing Empowered by Multimodal Generative Priors
by: Gao, Xuan, et al.
Published: (2024) -
Real-time 3D-aware Portrait Video Relighting
by: Cai, Ziqi, et al.
Published: (2024) -
MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars
by: Taubner, Felix, et al.
Published: (2025)