:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	R., Mallikarjun B., Yin, Fei, Voleti, Vikram, Drobyshev, Nikita, Lapin, Maksim, Vasishta, Aaryaman, Jampani, Varun
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2509.17476
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement
von: Boss, Mark, et al.
Veröffentlicht: (2024)

Stable Virtual Camera: Generative View Synthesis with Diffusion Models
von: Zhou, Jensen, et al.
Veröffentlicht: (2025)

FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation
von: Kompanowski, Hubert, et al.
Veröffentlicht: (2025)

SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images
von: Huang, Zixuan, et al.
Veröffentlicht: (2025)

SV4D 2.0: Enhancing Spatio-Temporal Consistency in Multi-View Video Diffusion for High-Quality 4D Generation
von: Yao, Chun-Han, et al.
Veröffentlicht: (2025)

HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model
von: Nguyen, Hieu T., et al.
Veröffentlicht: (2024)

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency
von: Xie, Yiming, et al.
Veröffentlicht: (2024)

FaceCraft4D: Animated 3D Facial Avatar Generation from a Single Image
von: Yin, Fei, et al.
Veröffentlicht: (2025)

Stable Cinemetrics : Structured Taxonomy and Evaluation for Professional Video Generation
von: Chatterjee, Agneet, et al.
Veröffentlicht: (2025)

SViM3D: Stable Video Material Diffusion for Single Image 3D Generation
von: Engelhardt, Andreas, et al.
Veröffentlicht: (2025)

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
von: Voleti, Vikram, et al.
Veröffentlicht: (2024)

HumANDiff: Articulated Noise Diffusion for Motion-Consistent Human Video Generation
von: Hu, Tao, et al.
Veröffentlicht: (2026)

Human Video Generation from a Single Image with 3D Pose and View Control
von: Wang, Tiantian, et al.
Veröffentlicht: (2026)

Stable Part Diffusion 4D: Multi-View RGB and Kinematic Parts Video Generation
von: Zhang, Hao, et al.
Veröffentlicht: (2025)

Stable Video Portraits
von: Ostrek, Mirela, et al.
Veröffentlicht: (2024)

Computational Tradeoffs in Image Synthesis: Diffusion, Masked-Token, and Next-Token Prediction
von: Kilian, Maciej, et al.
Veröffentlicht: (2024)

FlashLips: 100-FPS Mask-Free Latent Lip-Sync using Reconstruction Instead of Diffusion or GANs
von: Zinonos, Andreas, et al.
Veröffentlicht: (2025)

Unified Dense Prediction of Video Diffusion
von: Yang, Lehan, et al.
Veröffentlicht: (2025)

EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
von: Drobyshev, Nikita, et al.
Veröffentlicht: (2024)

HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models
von: Peng, Xiaogang, et al.
Veröffentlicht: (2023)

MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation
von: Hu, Hanzhe, et al.
Veröffentlicht: (2024)

Foley Control: Aligning a Frozen Latent Text-to-Audio Model to Video
von: Rowles, Ciara, et al.
Veröffentlicht: (2025)

Learning Action and Reasoning-Centric Image Editing from Videos and Simulations
von: Krojer, Benno, et al.
Veröffentlicht: (2024)

MARBLE: Material Recomposition and Blending in CLIP-Space
von: Cheng, Ta-Ying, et al.
Veröffentlicht: (2025)

Lite2Relight: 3D-aware Single Image Portrait Relighting
von: Rao, Pramod, et al.
Veröffentlicht: (2024)

PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling
von: Zhang, Hao, et al.
Veröffentlicht: (2025)

Block Cascading: Training Free Acceleration of Block-Causal Video Models
von: Bandyopadhyay, Hmrishav, et al.
Veröffentlicht: (2025)

ZeST: Zero-Shot Material Transfer from a Single Image
von: Cheng, Ta-Ying, et al.
Veröffentlicht: (2024)

ZeroShape: Regression-based Zero-shot Shape Reconstruction
von: Huang, Zixuan, et al.
Veröffentlicht: (2023)

ReSWD: ReSTIR'd, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction
von: Boss, Mark, et al.
Veröffentlicht: (2025)

SMooDi: Stylized Motion Diffusion Model
von: Zhong, Lei, et al.
Veröffentlicht: (2024)

WordRobe: Text-Guided Generation of Textured 3D Garments
von: Srivastava, Astitva, et al.
Veröffentlicht: (2024)

OmniControl: Control Any Joint at Any Time for Human Motion Generation
von: Xie, Yiming, et al.
Veröffentlicht: (2023)

3DPR: Single Image 3D Portrait Relight using Generative Priors
von: Rao, Pramod, et al.
Veröffentlicht: (2025)

KeyFace: Expressive Audio-Driven Facial Animation for Long Sequences via KeyFrame Interpolation
von: Bigata, Antoni, et al.
Veröffentlicht: (2025)

FlowPortrait: Reinforcement Learning for Audio-Driven Portrait Video Generation
von: Tan, Weiting, et al.
Veröffentlicht: (2026)

FactorPortrait: Controllable Portrait Animation via Disentangled Expression, Pose, and Viewpoint
von: Tang, Jiapeng, et al.
Veröffentlicht: (2025)

GMTalker: Gaussian Mixture-based Audio-Driven Emotional Talking Video Portraits
von: Xia, Yibo, et al.
Veröffentlicht: (2023)

Not all Views are Created Equal: Analyzing Viewpoint Instabilities in Vision Foundation Models
von: Michalkiewicz, Mateusz, et al.
Veröffentlicht: (2024)

PS-StyleGAN: Illustrative Portrait Sketching using Attention-Based Style Adaptation
von: Jain, Kushal Kumar, et al.
Veröffentlicht: (2024)