Saved in:
| Main Authors: | Ma, Xiaohe, Deschaintre, Valentin, Hašan, Miloš, Luan, Fujun, Zhou, Kun, Wu, Hongzhi, Hu, Yiwei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.03225 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RealMat: Realistic Materials with Diffusion and Reinforcement Learning
by: Zhou, Xilong, et al.
Published: (2025)
by: Zhou, Xilong, et al.
Published: (2025)
HiMat: DiT-based Ultra-High Resolution SVBRDF Generation
by: Wang, Zixiong, et al.
Published: (2025)
by: Wang, Zixiong, et al.
Published: (2025)
RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models
by: Zeng, Zheng, et al.
Published: (2024)
by: Zeng, Zheng, et al.
Published: (2024)
Uncertainty for SVBRDF Acquisition using Frequency Analysis
by: Wiersma, Ruben, et al.
Published: (2024)
by: Wiersma, Ruben, et al.
Published: (2024)
MatSynth: A Modern PBR Materials Dataset
by: Vecchio, Giuseppe, et al.
Published: (2024)
by: Vecchio, Giuseppe, et al.
Published: (2024)
DiVE: DiT-based Video Generation with Enhanced Control
by: Jiang, Junpeng, et al.
Published: (2024)
by: Jiang, Junpeng, et al.
Published: (2024)
Generating 360° Video is What You Need For a 3D Scene
by: Zhang, Zhaoyang, et al.
Published: (2025)
by: Zhang, Zhaoyang, et al.
Published: (2025)
TexSliders: Diffusion-Based Texture Editing in CLIP Space
by: Guerrero-Viu, Julia, et al.
Published: (2024)
by: Guerrero-Viu, Julia, et al.
Published: (2024)
RNA: Relightable Neural Assets
by: Mullia, Krishna, et al.
Published: (2023)
by: Mullia, Krishna, et al.
Published: (2023)
ControlMat: A Controlled Generative Approach to Material Capture
by: Vecchio, Giuseppe, et al.
Published: (2023)
by: Vecchio, Giuseppe, et al.
Published: (2023)
RNG: Relightable Neural Gaussians
by: Fan, Jiahui, et al.
Published: (2024)
by: Fan, Jiahui, et al.
Published: (2024)
Neural Product Importance Sampling via Warp Composition
by: Litalien, Joey, et al.
Published: (2024)
by: Litalien, Joey, et al.
Published: (2024)
MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration
by: Kong, Lingshun, et al.
Published: (2026)
by: Kong, Lingshun, et al.
Published: (2026)
SAMa: Material-aware 3D Selection and Segmentation
by: Fischer, Michael, et al.
Published: (2024)
by: Fischer, Michael, et al.
Published: (2024)
Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising
by: Fang, Gongfan, et al.
Published: (2024)
by: Fang, Gongfan, et al.
Published: (2024)
MeshLRM: Large Reconstruction Model for High-Quality Meshes
by: Wei, Xinyue, et al.
Published: (2024)
by: Wei, Xinyue, et al.
Published: (2024)
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation
by: Zhao, Lin, et al.
Published: (2026)
by: Zhao, Lin, et al.
Published: (2026)
Fine-Grained Spatially Varying Material Selection in Images
by: Guerrero-Viu, Julia, et al.
Published: (2025)
by: Guerrero-Viu, Julia, et al.
Published: (2025)
GenMask: Adapting DiT for Segmentation via Direct Mask Generation
by: Yang, Yuhuan, et al.
Published: (2026)
by: Yang, Yuhuan, et al.
Published: (2026)
Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT
by: Liu, Dongyang, et al.
Published: (2025)
by: Liu, Dongyang, et al.
Published: (2025)
DiT4Edit: Diffusion Transformer for Image Editing
by: Feng, Kunyu, et al.
Published: (2024)
by: Feng, Kunyu, et al.
Published: (2024)
Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors
by: Kuang, Zhengfei, et al.
Published: (2024)
by: Kuang, Zhengfei, et al.
Published: (2024)
DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation
by: Feng, He, et al.
Published: (2025)
by: Feng, He, et al.
Published: (2025)
VideoMatGen: PBR Materials through Joint Generative Modeling
by: Hasselgren, Jon, et al.
Published: (2026)
by: Hasselgren, Jon, et al.
Published: (2026)
RelitLRM: Generative Relightable Radiance for Large Reconstruction Models
by: Zhang, Tianyuan, et al.
Published: (2024)
by: Zhang, Tianyuan, et al.
Published: (2024)
Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
by: Qi, Tianhao, et al.
Published: (2025)
by: Qi, Tianhao, et al.
Published: (2025)
PTQ4DiT: Post-training Quantization for Diffusion Transformers
by: Wu, Junyi, et al.
Published: (2024)
by: Wu, Junyi, et al.
Published: (2024)
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation
by: Chen, Bowen, et al.
Published: (2025)
by: Chen, Bowen, et al.
Published: (2025)
MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment
by: Ceylan, Duygu, et al.
Published: (2024)
by: Ceylan, Duygu, et al.
Published: (2024)
IntrinsicEdit: Precise generative image manipulation in intrinsic space
by: Lyu, Linjie, et al.
Published: (2025)
by: Lyu, Linjie, et al.
Published: (2025)
Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE
by: Shi, Yangming, et al.
Published: (2026)
by: Shi, Yangming, et al.
Published: (2026)
U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
by: Tian, Yuchuan, et al.
Published: (2024)
by: Tian, Yuchuan, et al.
Published: (2024)
DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
by: Chen, Chen, et al.
Published: (2025)
by: Chen, Chen, et al.
Published: (2025)
LaVin-DiT: Large Vision Diffusion Transformer
by: Wang, Zhaoqing, et al.
Published: (2024)
by: Wang, Zhaoqing, et al.
Published: (2024)
Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
by: Chen, Lei, et al.
Published: (2024)
by: Chen, Lei, et al.
Published: (2024)
Paying U-Attention to Textures: Multi-Stage Hourglass Vision Transformer for Universal Texture Synthesis
by: Guo, Shouchang, et al.
Published: (2022)
by: Guo, Shouchang, et al.
Published: (2022)
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
by: Shi, Junqi, et al.
Published: (2026)
by: Shi, Junqi, et al.
Published: (2026)
DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
by: Feng, Haoran, et al.
Published: (2025)
by: Feng, Haoran, et al.
Published: (2025)
3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering
by: Zhou, Dewei, et al.
Published: (2025)
by: Zhou, Dewei, et al.
Published: (2025)
LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation
by: Yang, Lianwei, et al.
Published: (2025)
by: Yang, Lianwei, et al.
Published: (2025)
Similar Items
-
RealMat: Realistic Materials with Diffusion and Reinforcement Learning
by: Zhou, Xilong, et al.
Published: (2025) -
HiMat: DiT-based Ultra-High Resolution SVBRDF Generation
by: Wang, Zixiong, et al.
Published: (2025) -
RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models
by: Zeng, Zheng, et al.
Published: (2024) -
Uncertainty for SVBRDF Acquisition using Frequency Analysis
by: Wiersma, Ruben, et al.
Published: (2024) -
MatSynth: A Modern PBR Materials Dataset
by: Vecchio, Giuseppe, et al.
Published: (2024)