:: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Bradbury, Rowan, Zhong, Dazhi
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Graphics Machine Learning
Online Access:	https://arxiv.org/abs/2512.05198
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR
by: Fekri, Pedram, et al.
Published: (2026)

Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023)

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
by: Liu, Shengqi, et al.
Published: (2024)

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
by: Xing, Xiaoyan, et al.
Published: (2024)

MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data
by: Borne--Pons, Paul, et al.
Published: (2025)

Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer
by: Rahman, Md Ashiqur, et al.
Published: (2025)

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
by: Ge, Songwei, et al.
Published: (2023)

COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models
by: Daiya, Divyanshu, et al.
Published: (2024)

SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
by: Hong, Seokhyeon, et al.
Published: (2025)

AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
by: Guo, Yuwei, et al.
Published: (2023)

ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting
by: Wang, Daniel, et al.
Published: (2025)

An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion
by: Yan, Xingguang, et al.
Published: (2024)

Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation
by: Cuevas-Velasquez, Hanz, et al.
Published: (2021)

Click2Mask: Local Editing with Dynamic Mask Generation
by: Regev, Omer, et al.
Published: (2024)

Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models
by: Zhang, Tianyi, et al.
Published: (2025)

NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
by: Zeng, Yu, et al.
Published: (2025)

End-to-End Training for Unified Tokenization and Latent Denoising
by: Duggal, Shivam, et al.
Published: (2026)

RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions
by: Sheng, Xuhan, et al.
Published: (2024)

Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor
by: Dagli, Rishit, et al.
Published: (2025)

Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild
by: Orzech, Nadav, et al.
Published: (2024)

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
by: Kalischek, Nikolai, et al.
Published: (2022)

Flexible Motion In-betweening with Diffusion Models
by: Cohan, Setareh, et al.
Published: (2024)

Distilling Diffusion Models into Conditional GANs
by: Kang, Minguk, et al.
Published: (2024)

Interpreting the Weight Space of Customized Diffusion Models
by: Dravid, Amil, et al.
Published: (2024)

Diff-3DCap: Shape Captioning with Diffusion Models
by: Shu, Zhenyu, et al.
Published: (2025)

SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
by: Gandikota, Rohit, et al.
Published: (2025)

Curved Diffusion: A Generative Model With Optical Geometry Control
by: Voynov, Andrey, et al.
Published: (2023)

Enhancing Image Layout Control with Loss-Guided Diffusion Models
by: Patel, Zakaria, et al.
Published: (2024)

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
by: Avrahami, Omri, et al.
Published: (2023)

Adaptive Hybrid Caching for Efficient Text-to-Video Diffusion Model Acceleration
by: Wei, Yuanxin, et al.
Published: (2025)

Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
by: Liu, Haofeng, et al.
Published: (2024)

VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
by: Han, Junlin, et al.
Published: (2024)

Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
by: Laria, Héctor, et al.
Published: (2025)

DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024)

CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
by: Wang, Yuan, et al.
Published: (2025)

Transparent Image Layer Diffusion using Latent Transparency
by: Zhang, Lvmin, et al.
Published: (2024)

REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
by: Almog, Gal, et al.
Published: (2025)

L3DG: Latent 3D Gaussian Diffusion
by: Roessle, Barbara, et al.
Published: (2024)

TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models
by: Higo, Kazuki, et al.
Published: (2025)

A General Implicit Framework for Fast NeRF Composition and Rendering
by: Gao, Xinyu, et al.
Published: (2023)