Saved in:
| Main Authors: | Bradbury, Rowan, Zhong, Dazhi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.05198 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR
by: Fekri, Pedram, et al.
Published: (2026)
by: Fekri, Pedram, et al.
Published: (2026)
Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023)
by: Mitchel, Thomas W., et al.
Published: (2023)
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
by: Liu, Shengqi, et al.
Published: (2024)
by: Liu, Shengqi, et al.
Published: (2024)
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
by: Xing, Xiaoyan, et al.
Published: (2024)
by: Xing, Xiaoyan, et al.
Published: (2024)
MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data
by: Borne--Pons, Paul, et al.
Published: (2025)
by: Borne--Pons, Paul, et al.
Published: (2025)
Local Scale Equivariance with Latent Deep Equilibrium Canonicalizer
by: Rahman, Md Ashiqur, et al.
Published: (2025)
by: Rahman, Md Ashiqur, et al.
Published: (2025)
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
by: Ge, Songwei, et al.
Published: (2023)
by: Ge, Songwei, et al.
Published: (2023)
COLLAGE: Collaborative Human-Agent Interaction Generation using Hierarchical Latent Diffusion and Language Models
by: Daiya, Divyanshu, et al.
Published: (2024)
by: Daiya, Divyanshu, et al.
Published: (2024)
SALAD: Skeleton-aware Latent Diffusion for Text-driven Motion Generation and Editing
by: Hong, Seokhyeon, et al.
Published: (2025)
by: Hong, Seokhyeon, et al.
Published: (2025)
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
by: Guo, Yuwei, et al.
Published: (2023)
by: Guo, Yuwei, et al.
Published: (2023)
ODE-GS: Latent ODEs for Dynamic Scene Extrapolation with 3D Gaussian Splatting
by: Wang, Daniel, et al.
Published: (2025)
by: Wang, Daniel, et al.
Published: (2025)
An Object is Worth 64x64 Pixels: Generating 3D Object via Image Diffusion
by: Yan, Xingguang, et al.
Published: (2024)
by: Yan, Xingguang, et al.
Published: (2024)
Two Heads are Better than One: Geometric-Latent Attention for Point Cloud Classification and Segmentation
by: Cuevas-Velasquez, Hanz, et al.
Published: (2021)
by: Cuevas-Velasquez, Hanz, et al.
Published: (2021)
Click2Mask: Local Editing with Dynamic Mask Generation
by: Regev, Omer, et al.
Published: (2024)
by: Regev, Omer, et al.
Published: (2024)
Infinite Leagues Under the Sea: Photorealistic 3D Underwater Terrain Generation by Latent Fractal Diffusion Models
by: Zhang, Tianyi, et al.
Published: (2025)
by: Zhang, Tianyi, et al.
Published: (2025)
NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation
by: Zeng, Yu, et al.
Published: (2025)
by: Zeng, Yu, et al.
Published: (2025)
End-to-End Training for Unified Tokenization and Latent Denoising
by: Duggal, Shivam, et al.
Published: (2026)
by: Duggal, Shivam, et al.
Published: (2026)
RealOSR: Latent Guidance Boosts Diffusion-based Real-world Omnidirectional Image Super-Resolutions
by: Sheng, Xuhan, et al.
Published: (2024)
by: Sheng, Xuhan, et al.
Published: (2024)
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor
by: Dagli, Rishit, et al.
Published: (2025)
by: Dagli, Rishit, et al.
Published: (2025)
Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild
by: Orzech, Nadav, et al.
Published: (2024)
by: Orzech, Nadav, et al.
Published: (2024)
TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
by: Kalischek, Nikolai, et al.
Published: (2022)
by: Kalischek, Nikolai, et al.
Published: (2022)
Flexible Motion In-betweening with Diffusion Models
by: Cohan, Setareh, et al.
Published: (2024)
by: Cohan, Setareh, et al.
Published: (2024)
Distilling Diffusion Models into Conditional GANs
by: Kang, Minguk, et al.
Published: (2024)
by: Kang, Minguk, et al.
Published: (2024)
Interpreting the Weight Space of Customized Diffusion Models
by: Dravid, Amil, et al.
Published: (2024)
by: Dravid, Amil, et al.
Published: (2024)
Diff-3DCap: Shape Captioning with Diffusion Models
by: Shu, Zhenyu, et al.
Published: (2025)
by: Shu, Zhenyu, et al.
Published: (2025)
SliderSpace: Decomposing the Visual Capabilities of Diffusion Models
by: Gandikota, Rohit, et al.
Published: (2025)
by: Gandikota, Rohit, et al.
Published: (2025)
Curved Diffusion: A Generative Model With Optical Geometry Control
by: Voynov, Andrey, et al.
Published: (2023)
by: Voynov, Andrey, et al.
Published: (2023)
Enhancing Image Layout Control with Loss-Guided Diffusion Models
by: Patel, Zakaria, et al.
Published: (2024)
by: Patel, Zakaria, et al.
Published: (2024)
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
by: Avrahami, Omri, et al.
Published: (2023)
by: Avrahami, Omri, et al.
Published: (2023)
Adaptive Hybrid Caching for Efficient Text-to-Video Diffusion Model Acceleration
by: Wei, Yuanxin, et al.
Published: (2025)
by: Wei, Yuanxin, et al.
Published: (2025)
Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation
by: Liu, Haofeng, et al.
Published: (2024)
by: Liu, Haofeng, et al.
Published: (2024)
VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
by: Han, Junlin, et al.
Published: (2024)
by: Han, Junlin, et al.
Published: (2024)
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
by: Laria, Héctor, et al.
Published: (2025)
by: Laria, Héctor, et al.
Published: (2025)
DreamBlend: Advancing Personalized Fine-tuning of Text-to-Image Diffusion Models
by: Ram, Shwetha, et al.
Published: (2024)
by: Ram, Shwetha, et al.
Published: (2024)
CookingDiffusion: Cooking Procedural Image Generation with Stable Diffusion
by: Wang, Yuan, et al.
Published: (2025)
by: Wang, Yuan, et al.
Published: (2025)
Transparent Image Layer Diffusion using Latent Transparency
by: Zhang, Lvmin, et al.
Published: (2024)
by: Zhang, Lvmin, et al.
Published: (2024)
REED-VAE: RE-Encode Decode Training for Iterative Image Editing with Diffusion Models
by: Almog, Gal, et al.
Published: (2025)
by: Almog, Gal, et al.
Published: (2025)
L3DG: Latent 3D Gaussian Diffusion
by: Roessle, Barbara, et al.
Published: (2024)
by: Roessle, Barbara, et al.
Published: (2024)
TerraFusion: Joint Generation of Terrain Geometry and Texture Using Latent Diffusion Models
by: Higo, Kazuki, et al.
Published: (2025)
by: Higo, Kazuki, et al.
Published: (2025)
A General Implicit Framework for Fast NeRF Composition and Rendering
by: Gao, Xinyu, et al.
Published: (2023)
by: Gao, Xinyu, et al.
Published: (2023)
Similar Items
-
LatentHDR: Decoupling Exposure from Diffusion via Conditional Latent-to-Latent Mapping for Text/Image-to-Panoramic HDR
by: Fekri, Pedram, et al.
Published: (2026) -
Single Mesh Diffusion Models with Field Latents for Texture Generation
by: Mitchel, Thomas W., et al.
Published: (2023) -
Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
by: Liu, Shengqi, et al.
Published: (2024) -
LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting
by: Xing, Xiaoyan, et al.
Published: (2024) -
MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data
by: Borne--Pons, Paul, et al.
Published: (2025)