Saved in:
| Main Author: | Kong, Fei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.13387 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Compressed Image Generation with Denoising Diffusion Codebook Models
by: Ohayon, Guy, et al.
Published: (2025)
by: Ohayon, Guy, et al.
Published: (2025)
Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images
by: Di Via, Roberto, et al.
Published: (2024)
by: Di Via, Roberto, et al.
Published: (2024)
Turbo-DDCM: Fast and Flexible Zero-Shot Diffusion-Based Image Compression
by: Vaisman, Amit, et al.
Published: (2025)
by: Vaisman, Amit, et al.
Published: (2025)
StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN
by: Bedychaj, Andrzej, et al.
Published: (2024)
by: Bedychaj, Andrzej, et al.
Published: (2024)
SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation
by: Chen, Zisheng, et al.
Published: (2025)
by: Chen, Zisheng, et al.
Published: (2025)
Denoising Task Routing for Diffusion Models
by: Park, Byeongjun, et al.
Published: (2023)
by: Park, Byeongjun, et al.
Published: (2023)
CyberHost: Taming Audio-driven Avatar Diffusion Model with Region Codebook Attention
by: Lin, Gaojie, et al.
Published: (2024)
by: Lin, Gaojie, et al.
Published: (2024)
Generic Event Boundary Detection via Denoising Diffusion
by: Hwang, Jaejun, et al.
Published: (2025)
by: Hwang, Jaejun, et al.
Published: (2025)
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model
by: Zhuang, Shaobin, et al.
Published: (2026)
by: Zhuang, Shaobin, et al.
Published: (2026)
Improving fine-grained understanding in image-text pre-training
by: Bica, Ioana, et al.
Published: (2024)
by: Bica, Ioana, et al.
Published: (2024)
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
by: Chen, Zigeng, et al.
Published: (2024)
by: Chen, Zigeng, et al.
Published: (2024)
Intention-aware Denoising Diffusion Model for Trajectory Prediction
by: Liu, Chen, et al.
Published: (2024)
by: Liu, Chen, et al.
Published: (2024)
DiTraj: training-free trajectory control for video diffusion transformer
by: Lei, Cheng, et al.
Published: (2025)
by: Lei, Cheng, et al.
Published: (2025)
LiDAR Point Cloud Image-based Generation Using Denoising Diffusion Probabilistic Models
by: Aghanouri, Amirhesam, et al.
Published: (2025)
by: Aghanouri, Amirhesam, et al.
Published: (2025)
Counterfactual MRI Data Augmentation using Conditional Denoising Diffusion Generative Models
by: Morão, Pedro, et al.
Published: (2024)
by: Morão, Pedro, et al.
Published: (2024)
Unsupervised Region-Based Image Editing of Denoising Diffusion Models
by: Li, Zixiang, et al.
Published: (2024)
by: Li, Zixiang, et al.
Published: (2024)
Revisiting MAE pre-training for 3D medical image segmentation
by: Wald, Tassilo, et al.
Published: (2024)
by: Wald, Tassilo, et al.
Published: (2024)
Generative Pre-trained Autoregressive Diffusion Transformer
by: Zhang, Yuan, et al.
Published: (2025)
by: Zhang, Yuan, et al.
Published: (2025)
GRIF-DM: Generation of Rich Impression Fonts using Diffusion Models
by: Kang, Lei, et al.
Published: (2024)
by: Kang, Lei, et al.
Published: (2024)
Denoising Diffusion as a New Framework for Underwater Images
by: Jain, Nilesh, et al.
Published: (2025)
by: Jain, Nilesh, et al.
Published: (2025)
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation
by: Yu, Lijun, et al.
Published: (2023)
by: Yu, Lijun, et al.
Published: (2023)
FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models
by: Bulat, Adrian, et al.
Published: (2024)
by: Bulat, Adrian, et al.
Published: (2024)
Distilling Multi-view Diffusion Models into 3D Generators
by: Qin, Hao, et al.
Published: (2025)
by: Qin, Hao, et al.
Published: (2025)
DiffSSC: Semantic LiDAR Scan Completion using Denoising Diffusion Probabilistic Models
by: Cao, Helin, et al.
Published: (2024)
by: Cao, Helin, et al.
Published: (2024)
Discriminative Class Tokens for Text-to-Image Diffusion Models
by: Schwartz, Idan, et al.
Published: (2023)
by: Schwartz, Idan, et al.
Published: (2023)
Simultaneous Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2026)
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2026)
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models
by: Feng, Tongtong, et al.
Published: (2024)
by: Feng, Tongtong, et al.
Published: (2024)
Complete Gaussian Splats from a Single Image with Denoising Diffusion Models
by: Liao, Ziwei, et al.
Published: (2025)
by: Liao, Ziwei, et al.
Published: (2025)
Point-RTD: Replaced Token Denoising for Pretraining Transformer Models on Point Clouds
by: Stone, Gunner, et al.
Published: (2025)
by: Stone, Gunner, et al.
Published: (2025)
QVD: Post-training Quantization for Video Diffusion Models
by: Tian, Shilong, et al.
Published: (2024)
by: Tian, Shilong, et al.
Published: (2024)
MammoRGB: Dual-View Mammogram Synthesis Using Denoising Diffusion Probabilistic Models
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2025)
by: Garza-Abdala, Jorge Alberto, et al.
Published: (2025)
Resfusion: Denoising Diffusion Probabilistic Models for Image Restoration Based on Prior Residual Noise
by: Shi, Zhenning, et al.
Published: (2023)
by: Shi, Zhenning, et al.
Published: (2023)
UniCode: Learning a Unified Codebook for Multimodal Large Language Models
by: Zheng, Sipeng, et al.
Published: (2024)
by: Zheng, Sipeng, et al.
Published: (2024)
Data Extrapolation for Text-to-image Generation on Small Datasets
by: Ye, Senmao, et al.
Published: (2024)
by: Ye, Senmao, et al.
Published: (2024)
DiffFinger: Advancing Synthetic Fingerprint Generation through Denoising Diffusion Probabilistic Models
by: Grabovski, Freddie, et al.
Published: (2024)
by: Grabovski, Freddie, et al.
Published: (2024)
Remix-DiT: Mixing Diffusion Transformers for Multi-Expert Denoising
by: Fang, Gongfan, et al.
Published: (2024)
by: Fang, Gongfan, et al.
Published: (2024)
G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models
by: Zhang, Tianjiao, et al.
Published: (2025)
by: Zhang, Tianjiao, et al.
Published: (2025)
Vector Grimoire: Codebook-based Shape Generation under Raster Image Supervision
by: Feuerpfeil, Moritz, et al.
Published: (2024)
by: Feuerpfeil, Moritz, et al.
Published: (2024)
Objective and Interpretable Breast Cosmesis Evaluation with Attention Guided Denoising Diffusion Anomaly Detection Model
by: Park, Sangjoon, et al.
Published: (2024)
by: Park, Sangjoon, et al.
Published: (2024)
On Improved Conditioning Mechanisms and Pre-training Strategies for Diffusion Models
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
by: Ifriqi, Tariq Berrada, et al.
Published: (2024)
Similar Items
-
Compressed Image Generation with Denoising Diffusion Codebook Models
by: Ohayon, Guy, et al.
Published: (2025) -
Self-supervised pre-training with diffusion model for few-shot landmark detection in x-ray images
by: Di Via, Roberto, et al.
Published: (2024) -
Turbo-DDCM: Fast and Flexible Zero-Shot Diffusion-Based Image Compression
by: Vaisman, Amit, et al.
Published: (2025) -
StyleAutoEncoder for manipulating image attributes using pre-trained StyleGAN
by: Bedychaj, Andrzej, et al.
Published: (2024) -
SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation
by: Chen, Zisheng, et al.
Published: (2025)