:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Niu, Xuexiang, Tang, Jinping, Wang, Lei, Zhu, Ge
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.00122
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bridging the Intention-Expression Gap: Aligning Multi-Dimensional Preferences via Hierarchical Relevance Feedback in Text-to-Image Diffusion
by: Wang, Wenxi, et al.
Published: (2026)

Category-level Text-to-Image Retrieval Improved: Bridging the Domain Gap with Diffusion Models and Vision Encoders
by: Khan, Faizan Farooq, et al.
Published: (2025)

Asynchronous Denoising Diffusion Models for Aligning Text-to-Image Generation
by: Hu, Zijing, et al.
Published: (2025)

Can Diffusion Models Bridge the Domain Gap in Cardiac MR Imaging?
by: Wong, Xin Ci, et al.
Published: (2025)

TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models
by: Xiao, Yao, et al.
Published: (2025)

Fair Text to Medical Image Diffusion Model with Subgroup Distribution Aligned Tuning
by: Han, Xu, et al.
Published: (2024)

GrOCE:Graph-Guided Online Concept Erasure for Text-to-Image Diffusion Models
by: Han, Ning, et al.
Published: (2025)

StyleBlend: Enhancing Style-Specific Content Creation in Text-to-Image Diffusion Models
by: Chen, Zichong, et al.
Published: (2025)

DSG-World: Learning a 3D Gaussian World Model from Dual State Videos
by: Hu, Wenhao, et al.
Published: (2025)

CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
by: Jiang, Dongzhi, et al.
Published: (2024)

Text-Anchored Score Composition: Tackling Condition Misalignment in Text-to-Image Diffusion Models
by: Wang, Luozhou, et al.
Published: (2023)

Energy-oriented Diffusion Bridge for Image Restoration with Foundational Diffusion Models
by: Hou, Jinhui, et al.
Published: (2026)

Bridging the Perception Gap in Image Super-Resolution Evaluation
by: Su, Shaolin, et al.
Published: (2025)

Safeguard Text-to-Image Diffusion Models with Human Feedback Inversion
by: Kim, Sanghyun, et al.
Published: (2024)

Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets
by: Kennerley, Mikhail, et al.
Published: (2025)

Text-Printed Image: Bridging the Image-Text Modality Gap for Text-centric Training of Large Vision-Language Models
by: Yamabe, Shojiro, et al.
Published: (2025)

A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
by: Yang, Shentao, et al.
Published: (2024)

FreeInit: Bridging Initialization Gap in Video Diffusion Models
by: Wu, Tianxing, et al.
Published: (2023)

Bridging the RGB-IR Gap: Consensus and Discrepancy Modeling for Text-Guided Multispectral Detection
by: Wu, Jiaqi, et al.
Published: (2026)

Noise Projection: Closing the Prompt-Agnostic Gap Behind Text-to-Image Misalignment in Diffusion Models
by: Tong, Yunze, et al.
Published: (2025)

Residual Diffusion Bridge Model for Image Restoration
by: Wang, Hebaixu, et al.
Published: (2025)

PC-Diffusion: Aligning Diffusion Models with Human Preferences via Preference Classifier
by: Wang, Shaomeng, et al.
Published: (2025)

Security Tensors as a Cross-Modal Bridge: Extending Text-Aligned Safety to Vision in LVLM
by: Li, Shen, et al.
Published: (2025)

Mind the Gap: Aligning Vision Foundation Models to Image Feature Matching
by: Liu, Yuhan, et al.
Published: (2025)

LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation
by: Wang, Xuqin, et al.
Published: (2025)

PrefPaint: Aligning Image Inpainting Diffusion Model with Human Preference
by: Liu, Kendong, et al.
Published: (2024)

ControlNet-XS: Rethinking the Control of Text-to-Image Diffusion Models as Feedback-Control Systems
by: Zavadski, Denis, et al.
Published: (2023)

Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD
by: Wu, Qinxin, et al.
Published: (2026)

Aligning Text-to-Image Diffusion Models with Reward Backpropagation
by: Prabhudesai, Mihir, et al.
Published: (2023)

Post-training Quantization for Text-to-Image Diffusion Models with Progressive Calibration and Activation Relaxing
by: Tang, Siao, et al.
Published: (2023)

Disciplined Diffusion: Text-to-Image Diffusion Model against NSFW Generation
by: Zhang, Chi, et al.
Published: (2026)

Filter & Align: Leveraging Human Knowledge to Curate Image-Text Data
by: Zhang, Lei, et al.
Published: (2023)

Palette Aligned Image Diffusion
by: Aharoni, Elad, et al.
Published: (2025)

Bridging the Pose-Semantic Gap: A Cascade Framework for Text-Based Person Anomaly Search
by: Xie, Zequn, et al.
Published: (2026)

Learning on Less: Constraining Pre-trained Model Learning for Generalizable Diffusion-Generated Image Detection
by: Chen, Yingjian, et al.
Published: (2024)

Auditing Data Provenance in Real-world Text-to-Image Diffusion Models for Privacy and Copyright Protection
by: Zhu, Jie, et al.
Published: (2025)

Scaling Down Text Encoders of Text-to-Image Diffusion Models
by: Wang, Lifu, et al.
Published: (2025)

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)

Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
by: Chen, Xinyan, et al.
Published: (2023)

Bridging Visual Affective Gap: Borrowing Textual Knowledge by Learning from Noisy Image-Text Pairs
by: Wu, Daiqing, et al.
Published: (2025)