Saved in:
| Main Authors: | Niu, Xuexiang, Tang, Jinping, Wang, Lei, Zhu, Ge |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.00122 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bridging the Intention-Expression Gap: Aligning Multi-Dimensional Preferences via Hierarchical Relevance Feedback in Text-to-Image Diffusion
by: Wang, Wenxi, et al.
Published: (2026)
by: Wang, Wenxi, et al.
Published: (2026)
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)
by: Khan, Faizan Farooq, et al.
Published: (2025)
Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation
by: Hu, Zijing, et al.
Published: (2025)
by: Hu, Zijing, et al.
Published: (2025)
Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging?
by: Wong, Xin Ci, et al.
Published: (2025)
by: Wong, Xin Ci, et al.
Published: (2025)
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
by: Xiao, Yao, et al.
Published: (2025)
by: Xiao, Yao, et al.
Published: (2025)
Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning
by: Han, Xu, et al.
Published: (2024)
by: Han, Xu, et al.
Published: (2024)
GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
by: Han, Ning, et al.
Published: (2025)
by: Han, Ning, et al.
Published: (2025)
StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models
by: Chen, Zichong, et al.
Published: (2025)
by: Chen, Zichong, et al.
Published: (2025)
DSG-World: Learning a 3D Gaussian World Model from Dual State Videos
by: Hu, Wenhao, et al.
Published: (2025)
by: Hu, Wenhao, et al.
Published: (2025)
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
by: Jiang, Dongzhi, et al.
Published: (2024)
by: Jiang, Dongzhi, et al.
Published: (2024)
Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
by: Wang, Luozhou, et al.
Published: (2023)
by: Wang, Luozhou, et al.
Published: (2023)
Energy-oriented Diffusion Bridge for Image Restoration with Foundational Diffusion Models
by: Hou, Jinhui, et al.
Published: (2026)
by: Hou, Jinhui, et al.
Published: (2026)
Bridging the Perception Gap in Image Super-Resolution Evaluation
by: Su, Shaolin, et al.
Published: (2025)
by: Su, Shaolin, et al.
Published: (2025)
Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
by: Kim, Sanghyun, et al.
Published: (2024)
by: Kim, Sanghyun, et al.
Published: (2024)
Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets
by: Kennerley, Mikhail, et al.
Published: (2025)
by: Kennerley, Mikhail, et al.
Published: (2025)
Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
by: Yamabe, Shojiro, et al.
Published: (2025)
by: Yamabe, Shojiro, et al.
Published: (2025)
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
by: Yang, Shentao, et al.
Published: (2024)
by: Yang, Shentao, et al.
Published: (2024)
FreeInit: Bridging Initialization Gap in Video Diffusion Models
by: Wu, Tianxing, et al.
Published: (2023)
by: Wu, Tianxing, et al.
Published: (2023)
Bridging the RGB-IR Gap: Consensus and Discrepancy Modeling for Text-Guided Multispectral Detection
by: Wu, Jiaqi, et al.
Published: (2026)
by: Wu, Jiaqi, et al.
Published: (2026)
Noise Projection: Closing the Prompt-Agnostic Gap Behind Text-to-Image Misalignment in Diffusion Models
by: Tong, Yunze, et al.
Published: (2025)
by: Tong, Yunze, et al.
Published: (2025)
Residual Diffusion Bridge Model for Image Restoration
by: Wang, Hebaixu, et al.
Published: (2025)
by: Wang, Hebaixu, et al.
Published: (2025)
PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier
by: Wang, Shaomeng, et al.
Published: (2025)
by: Wang, Shaomeng, et al.
Published: (2025)
Security Tensors as a Cross-Modal Bridge: Extending Text-Aligned Safety to Vision in LVLM
by: Li, Shen, et al.
Published: (2025)
by: Li, Shen, et al.
Published: (2025)
Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching
by: Liu, Yuhan, et al.
Published: (2025)
by: Liu, Yuhan, et al.
Published: (2025)
LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation
by: Wang, Xuqin, et al.
Published: (2025)
by: Wang, Xuqin, et al.
Published: (2025)
PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
by: Liu, Kendong, et al.
Published: (2024)
by: Liu, Kendong, et al.
Published: (2024)
ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems
by: Zavadski, Denis, et al.
Published: (2023)
by: Zavadski, Denis, et al.
Published: (2023)
Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD
by: Wu, Qinxin, et al.
Published: (2026)
by: Wu, Qinxin, et al.
Published: (2026)
Aligning Text-to-Image Diffusion Models with Reward Backpropagation
by: Prabhudesai, Mihir, et al.
Published: (2023)
by: Prabhudesai, Mihir, et al.
Published: (2023)
Post-training Quantization for Text-to-Image Diffusion Models with Progressive Calibration and Activation Relaxing
by: Tang, Siao, et al.
Published: (2023)
by: Tang, Siao, et al.
Published: (2023)
Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation
by: Zhang, Chi, et al.
Published: (2026)
by: Zhang, Chi, et al.
Published: (2026)
Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data
by: Zhang, Lei, et al.
Published: (2023)
by: Zhang, Lei, et al.
Published: (2023)
Palette Aligned Image Diffusion
by: Aharoni, Elad, et al.
Published: (2025)
by: Aharoni, Elad, et al.
Published: (2025)
Bridging the Pose-Semantic Gap: A Cascade Framework for Text-Based Person Anomaly Search
by: Xie, Zequn, et al.
Published: (2026)
by: Xie, Zequn, et al.
Published: (2026)
Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
by: Chen, Yingjian, et al.
Published: (2024)
by: Chen, Yingjian, et al.
Published: (2024)
Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
by: Zhu, Jie, et al.
Published: (2025)
by: Zhu, Jie, et al.
Published: (2025)
Scaling Down Text Encoders of Text-to-Image Diffusion Models
by: Wang, Lifu, et al.
Published: (2025)
by: Wang, Lifu, et al.
Published: (2025)
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
by: Chen, Xinyan, et al.
Published: (2023)
by: Chen, Xinyan, et al.
Published: (2023)
Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs
by: Wu, Daiqing, et al.
Published: (2025)
by: Wu, Daiqing, et al.
Published: (2025)
Similar Items
-
Bridging the Intention-Expression Gap: Aligning Multi-Dimensional Preferences via Hierarchical Relevance Feedback in Text-to-Image Diffusion
by: Wang, Wenxi, et al.
Published: (2026) -
Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025) -
Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation
by: Hu, Zijing, et al.
Published: (2025) -
Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging?
by: Wong, Xin Ci, et al.
Published: (2025) -
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
by: Xiao, Yao, et al.
Published: (2025)