:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ma, Xiaohe, Deschaintre, Valentin, Hašan, Miloš, Luan, Fujun, Zhou, Kun, Wu, Hongzhi, Hu, Yiwei
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.03225
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

RealMat: Realistic Materials with Diffusion and Reinforcement Learning
by: Zhou, Xilong, et al.
Published: (2025)

HiMat: DiT-based Ultra-High Resolution SVBRDF Generation
by: Wang, Zixiong, et al.
Published: (2025)

RGB$\leftrightarrow$X: Image decomposition and synthesis using material- and lighting-aware diffusion models
by: Zeng, Zheng, et al.
Published: (2024)

Uncertainty for SVBRDF Acquisition using Frequency Analysis
by: Wiersma, Ruben, et al.
Published: (2024)

MatSynth: A Modern PBR Materials Dataset
by: Vecchio, Giuseppe, et al.
Published: (2024)

DiVE: DiT-based Video Generation with Enhanced Control
by: Jiang, Junpeng, et al.
Published: (2024)

Generating 360° Video is What You Need For a 3D Scene
by: Zhang, Zhaoyang, et al.
Published: (2025)

TexSliders: Diffusion-Based Texture Editing in CLIP Space
by: Guerrero-Viu, Julia, et al.
Published: (2024)

RNA: Relightable Neural Assets
by: Mullia, Krishna, et al.
Published: (2023)

ControlMat: A Controlled Generative Approach to Material Capture
by: Vecchio, Giuseppe, et al.
Published: (2023)

RNG: Relightable Neural Gaussians
by: Fan, Jiahui, et al.
Published: (2024)

Neural Product Importance Sampling via Warp Composition
by: Litalien, Joey, et al.
Published: (2024)

MiM-DiT: MoE in MoE with Diffusion Transformers for All-in-One Image Restoration
by: Kong, Lingshun, et al.
Published: (2026)

SAMa: Material-aware 3D Selection and Segmentation
by: Fischer, Michael, et al.
Published: (2024)

Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising
by: Fang, Gongfan, et al.
Published: (2024)

MeshLRM: Large Reconstruction Model for High-Quality Meshes
by: Wei, Xinyue, et al.
Published: (2024)

S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation
by: Zhao, Lin, et al.
Published: (2026)

Fine-Grained Spatially Varying Material Selection in Images
by: Guerrero-Viu, Julia, et al.
Published: (2025)

GenMask: Adapting DiT for Segmentation via Direct Mask Generation
by: Yang, Yuhuan, et al.
Published: (2026)

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT
by: Liu, Dongyang, et al.
Published: (2025)

DiT4Edit: Diffusion Transformer for Image Editing
by: Feng, Kunyu, et al.
Published: (2024)

Buffer Anytime: Zero-Shot Video Depth and Normal from Image Priors
by: Kuang, Zhengfei, et al.
Published: (2024)

DiTalker: A Unified DiT-based Framework for High-Quality and Speaking Styles Controllable Portrait Animation
by: Feng, He, et al.
Published: (2025)

VideoMatGen: PBR Materials through Joint Generative Modeling
by: Hasselgren, Jon, et al.
Published: (2026)

RelitLRM: Generative Relightable Radiance for Large Reconstruction Models
by: Zhang, Tianyuan, et al.
Published: (2024)

Mask$^2$DiT: Dual Mask-based Diffusion Transformer for Multi-Scene Long Video Generation
by: Qi, Tianhao, et al.
Published: (2025)

PTQ4DiT: Post-training Quantization for Diffusion Transformers
by: Wu, Junyi, et al.
Published: (2024)

XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation
by: Chen, Bowen, et al.
Published: (2025)

MatAtlas: Text-driven Consistent Geometry Texturing and Material Assignment
by: Ceylan, Duygu, et al.
Published: (2024)

IntrinsicEdit: Precise generative image manipulation in intrinsic space
by: Lyu, Linjie, et al.
Published: (2025)

Mamoda2.5: Enhancing Unified Multimodal Model with DiT-MoE
by: Shi, Yangming, et al.
Published: (2026)

U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers
by: Tian, Yuchuan, et al.
Published: (2024)

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation
by: Chen, Chen, et al.
Published: (2025)

LaVin-DiT: Large Vision Diffusion Transformer
by: Wang, Zhaoqing, et al.
Published: (2024)

Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
by: Chen, Lei, et al.
Published: (2024)

Paying U-Attention to Textures: Multi-Stage Hourglass Vision Transformer for Universal Texture Synthesis
by: Guo, Shouchang, et al.
Published: (2022)

DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
by: Shi, Junqi, et al.
Published: (2026)

DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training
by: Feng, Haoran, et al.
Published: (2025)

3DIS-FLUX: simple and efficient multi-instance generation with DiT rendering
by: Zhou, Dewei, et al.
Published: (2025)

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation
by: Yang, Lianwei, et al.
Published: (2025)