:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiang, Jing, Ling, Yiran, Li, Binzhu, Li, Pengxiang, Piao, Junming, Zhang, Yu
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.06196
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Pixels, Patterns, but No Poetry: To See The World like Humans
by: Gao, Hongcheng, et al.
Published: (2025)

Single Image Iterative Subject-driven Generation and Editing
by: Shpitzer, Yair, et al.
Published: (2025)

Seeing the Poem: Image-Semantic Detection of AI-Generated Modern Chinese Poetry with MLLMs
by: Wang, Shanshan, et al.
Published: (2026)

Iterative Refinement Improves Compositional Image Generation
by: Jaiswal, Shantanu, et al.
Published: (2026)

Iterative Adversarial Attack on Image-guided Story Ending Generation
by: Wang, Youze, et al.
Published: (2023)

Autoregressive Image Generation with Linear Complexity: A Spatial-Aware Decay Perspective
by: Mao, Yuxin, et al.
Published: (2025)

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
by: Zhao, Yu, et al.
Published: (2024)

DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging
by: Cheng, Huimin, et al.
Published: (2025)

Regeneration Based Training-free Attribution of Fake Images Generated by Text-to-Image Generative Models
by: Li, Meiling, et al.
Published: (2024)

ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models
by: Zhou, Qin, et al.
Published: (2025)

Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
by: Jiang, Yuchu, et al.
Published: (2025)

Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning
by: You, Xiaoxing, et al.
Published: (2025)

StableI2I: Spotting Unintended Changes in Image-to-Image Transition
by: Li, Jiayang, et al.
Published: (2026)

GenShield: Unified Detection and Artifact Correction for AI-Generated Images
by: Xu, Zhipei, et al.
Published: (2026)

A Framework For Image Synthesis Using Supervised Contrastive Learning
by: Liu, Yibin, et al.
Published: (2024)

CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)

Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement
by: Jeong, Suchae, et al.
Published: (2025)

Condition-Aware Neural Network for Controlled Image Generation
by: Cai, Han, et al.
Published: (2024)

Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026)

Relative-Absolute Fusion: Rethinking Feature Extraction in Image-Based Iterative Method Selection for Solving Sparse Linear Systems
by: Zhang, Kaiqi, et al.
Published: (2025)

Multi-Agent Image Restoration
by: Jiang, Xu, et al.
Published: (2025)

ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
by: Zhao, Yiran, et al.
Published: (2026)

Improving Generalization of Medical Image Registration Foundation Model
by: Hu, Jing, et al.
Published: (2025)

Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis
by: Jiang, Yankai, et al.
Published: (2025)

GEBench: Benchmarking Image Generation Models as GUI Environments
by: Li, Haodong, et al.
Published: (2026)

REVEAL: Reasoning-Enhanced Forensic Evidence Analysis for Explainable AI-Generated Image Detection
by: Cao, Huangsen, et al.
Published: (2025)

Cross Modality Image Translation In Medical Imaging Using Generative Frameworks
by: Romoli, Giulia, et al.
Published: (2026)

IA-T2I: Internet-Augmented Text-to-Image Generation
by: Li, Chuanhao, et al.
Published: (2025)

Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
by: Jamil, Sofia, et al.
Published: (2025)

Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case
by: Li, Yufan, et al.
Published: (2021)

Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
by: Cho, Jaemin, et al.
Published: (2023)

RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning
by: Hu, Jing, et al.
Published: (2023)

VLM-Guided Iterative Refinement for Surgical Image Segmentation with Foundation Models
by: Lou, Ange, et al.
Published: (2026)

A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
by: Feng, Yukang, et al.
Published: (2025)

MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation
by: Xu, Caixu, et al.
Published: (2025)

Spatial-Aware Latent Initialization for Controllable Image Generation
by: Sun, Wenqiang, et al.
Published: (2024)

VModA: An Effective Framework for Adaptive NSFW Image Moderation
by: Bao, Han, et al.
Published: (2025)

Heterogeneous Generative Knowledge Distillation with Masked Image Modeling
by: Wang, Ziming, et al.
Published: (2023)

WithAnyone: Towards Controllable and ID Consistent Image Generation
by: Xu, Hengyuan, et al.
Published: (2025)

Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
by: Zhang, Bowen, et al.
Published: (2024)