Saved in:
| Main Authors: | Wang, Xi, He, Ziqi, Zhou, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.03471 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MIFO: Learning and Synthesizing Multi-Instance from One Image
by: Su, Kailun, et al.
Published: (2025)
by: Su, Kailun, et al.
Published: (2025)
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
by: Zhou, Feng, et al.
Published: (2025)
by: Zhou, Feng, et al.
Published: (2025)
Enhancing Text-to-Image Generation via End-Edge Collaborative Hybrid Super-Resolution
by: Yi, Chongbin, et al.
Published: (2026)
by: Yi, Chongbin, et al.
Published: (2026)
PCA-Enhanced Probabilistic U-Net for Effective Ambiguous Medical Image Segmentation
by: Li, Xiangyu, et al.
Published: (2026)
by: Li, Xiangyu, et al.
Published: (2026)
Object Fidelity Diffusion for Remote Sensing Image Generation
by: Ye, Ziqi, et al.
Published: (2025)
by: Ye, Ziqi, et al.
Published: (2025)
CasDyF-Net: Image Dehazing via Cascaded Dynamic Filters
by: Yinglong, Wang, et al.
Published: (2024)
by: Yinglong, Wang, et al.
Published: (2024)
Noise Diffusion for Enhancing Semantic Faithfulness in Text-to-Image Synthesis
by: Miao, Boming, et al.
Published: (2024)
by: Miao, Boming, et al.
Published: (2024)
U-REPA: Aligning Diffusion U-Nets to ViTs
by: Tian, Yuchuan, et al.
Published: (2025)
by: Tian, Yuchuan, et al.
Published: (2025)
DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis
by: Luo, Xinglong, et al.
Published: (2026)
by: Luo, Xinglong, et al.
Published: (2026)
Speedrunning ImageNet Diffusion
by: Bhanded, Swayam
Published: (2025)
by: Bhanded, Swayam
Published: (2025)
StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models
by: Chen, Zichong, et al.
Published: (2025)
by: Chen, Zichong, et al.
Published: (2025)
TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image Super-Resolution
by: Liu, Baolin, et al.
Published: (2023)
by: Liu, Baolin, et al.
Published: (2023)
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models
by: Yang, Hongji, et al.
Published: (2025)
by: Yang, Hongji, et al.
Published: (2025)
Semantic Image Synthesis via Diffusion Models
by: Zhou, Wengang, et al.
Published: (2022)
by: Zhou, Wengang, et al.
Published: (2022)
Enhancing Semantic Fidelity in Text-to-Image Synthesis: Attention Regulation in Diffusion Models
by: Zhang, Yang, et al.
Published: (2024)
by: Zhang, Yang, et al.
Published: (2024)
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps
by: Palaev, Andrey, et al.
Published: (2025)
by: Palaev, Andrey, et al.
Published: (2025)
Enhancing Feature Fusion of U-like Networks with Dynamic Skip Connections
by: Cao, Yue, et al.
Published: (2025)
by: Cao, Yue, et al.
Published: (2025)
Enhance Image Classification via Inter-Class Image Mixup with Diffusion Model
by: Wang, Zhicai, et al.
Published: (2024)
by: Wang, Zhicai, et al.
Published: (2024)
FREAK: Frequency-modulated High-fidelity and Real-time Audio-driven Talking Portrait Synthesis
by: Ni, Ziqi, et al.
Published: (2025)
by: Ni, Ziqi, et al.
Published: (2025)
Adaptively Distilled ControlNet: Accelerated Training and Superior Sampling for Medical Image Synthesis
by: Qiu, Kunpeng, et al.
Published: (2025)
by: Qiu, Kunpeng, et al.
Published: (2025)
PairingNet: A Learning-based Pair-searching and -matching Network for Image Fragments
by: Zhou, Rixin, et al.
Published: (2023)
by: Zhou, Rixin, et al.
Published: (2023)
TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models
by: Zhou, Teng, et al.
Published: (2024)
by: Zhou, Teng, et al.
Published: (2024)
FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models
by: Wang, Xi, et al.
Published: (2024)
by: Wang, Xi, et al.
Published: (2024)
Ultra-High-Resolution Image Synthesis with Pyramid Diffusion Model
by: Yang, Jiajie
Published: (2024)
by: Yang, Jiajie
Published: (2024)
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis
by: Shao, Shitong, et al.
Published: (2024)
by: Shao, Shitong, et al.
Published: (2024)
Enhancing Object Coherence in Layout-to-Image Synthesis
by: Wang, Yibin, et al.
Published: (2023)
by: Wang, Yibin, et al.
Published: (2023)
Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation
by: Qiu, Kunpeng, et al.
Published: (2025)
by: Qiu, Kunpeng, et al.
Published: (2025)
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation
by: Zhou, Jun, et al.
Published: (2024)
by: Zhou, Jun, et al.
Published: (2024)
PBE-UNet: A light weight Progressive Boundary-Enhanced U-Net with Scale-Aware Aggregation for Ultrasound Image Segmentation
by: Wang, Chen, et al.
Published: (2026)
by: Wang, Chen, et al.
Published: (2026)
Masked Conditional Diffusion Model for Enhancing Deepfake Detection
by: Chen, Tiewen, et al.
Published: (2024)
by: Chen, Tiewen, et al.
Published: (2024)
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
by: Shen, Fei, et al.
Published: (2023)
by: Shen, Fei, et al.
Published: (2023)
ReVersion: Diffusion-Based Relation Inversion from Images
by: Huang, Ziqi, et al.
Published: (2023)
by: Huang, Ziqi, et al.
Published: (2023)
Guided Image Synthesis via Initial Image Editing in Diffusion Model
by: Mao, Jiafeng, et al.
Published: (2023)
by: Mao, Jiafeng, et al.
Published: (2023)
UniConvNet: Expanding Effective Receptive Field while Maintaining Asymptotically Gaussian Distribution for ConvNets of Any Scale
by: Wang, Yuhao, et al.
Published: (2025)
by: Wang, Yuhao, et al.
Published: (2025)
DEPICT: Diffusion-Enabled Permutation Importance for Image Classification Tasks
by: Jabbour, Sarah, et al.
Published: (2024)
by: Jabbour, Sarah, et al.
Published: (2024)
DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
by: Ai, Yuang, et al.
Published: (2025)
by: Ai, Yuang, et al.
Published: (2025)
Latent Diffusion U-Net Representations Contain Positional Embeddings and Anomalies
by: Loos, Jonas, et al.
Published: (2025)
by: Loos, Jonas, et al.
Published: (2025)
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided Diffusion
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Low-light Image Enhancement via CLIP-Fourier Guided Wavelet Diffusion
by: Xue, Minglong, et al.
Published: (2024)
by: Xue, Minglong, et al.
Published: (2024)
AI-T2I: Aggregating-and-Isolating Cross-Attention to Diffusion Models for Text-to-Image Synthesis
by: Cao, Shipeng, et al.
Published: (2026)
by: Cao, Shipeng, et al.
Published: (2026)
Similar Items
-
MIFO: Learning and Synthesizing Multi-Instance from One Image
by: Su, Kailun, et al.
Published: (2025) -
Exploring Position Encoding in Diffusion U-Net for Training-free High-resolution Image Generation
by: Zhou, Feng, et al.
Published: (2025) -
Enhancing Text-to-Image Generation via End-Edge Collaborative Hybrid Super-Resolution
by: Yi, Chongbin, et al.
Published: (2026) -
PCA-Enhanced Probabilistic U-Net for Effective Ambiguous Medical Image Segmentation
by: Li, Xiangyu, et al.
Published: (2026) -
Object Fidelity Diffusion for Remote Sensing Image Generation
by: Ye, Ziqi, et al.
Published: (2025)