:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	van Peursen, Willem Th., Entsua-Mensah, Samuel E.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.02973
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Seeing The Words: Evaluating AI-generated Biblical Art
by: Makimei, Hidde, et al.
Published: (2025)

Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
by: Li, You, et al.
Published: (2024)

A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Disease Detection from Retinal Fundus Images
by: Djoumessi, Kerol, et al.
Published: (2025)

Imagine yourself: Tuning-Free Personalized Image Generation
by: He, Zecheng, et al.
Published: (2024)

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
by: Li, Hongyu, et al.
Published: (2024)

Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention
by: Cho, Wonwoong, et al.
Published: (2025)

Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything
by: Zou, Xiaotian, et al.
Published: (2024)

Exploring Bias in over 100 Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2025)

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
by: Lyu, Yueming, et al.
Published: (2023)

Exploring Text-Guided Single Image Editing for Remote Sensing Images
by: Han, Fangzhou, et al.
Published: (2024)

SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning
by: Li, Yian, et al.
Published: (2026)

Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)

ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis
by: Sun, Zhengwentai, et al.
Published: (2026)

Surgical Text-to-Image Generation
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)

Towards Interpretable Foundation Models for Retinal Fundus Images
by: Mensah, Samuel Ofosu, et al.
Published: (2026)

Color Bind: Exploring Color Perception in Text-to-Image Models
by: Shomer-Chai, Shay, et al.
Published: (2025)

TextBoost: Boosting Text Encoder for Personalized Text-to-Image Generation
by: Park, NaHyeon, et al.
Published: (2024)

OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation
by: Gunawan, Agus, et al.
Published: (2025)

Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
by: Lei, Sen, et al.
Published: (2024)

SeqBench: Benchmarking Sequential Narrative Generation in Text-to-Video Models
by: Tang, Zhengxu, et al.
Published: (2025)

Text4Seg: Reimagining Image Segmentation as Text Generation
by: Lan, Mengcheng, et al.
Published: (2024)

Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
by: Xuan, Shiyu, et al.
Published: (2025)

Prompt Refinement with Image Pivot for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)

Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
by: Zheng, Matthew, et al.
Published: (2024)

A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
by: Sordo, Zineb, et al.
Published: (2025)

DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior
by: Zhang, Yiming, et al.
Published: (2024)

Imagine360: Immersive 360 Video Generation from Perspective Anchor
by: Tan, Jing, et al.
Published: (2024)

Exploring Motion-Language Alignment for Text-driven Motion Generation
by: Gu, Ruxi, et al.
Published: (2026)

Generating Intermediate Representations for Compositional Text-To-Image Generation
by: Galun, Ran, et al.
Published: (2024)

Exploring the Naturalness of AI-Generated Images
by: Chen, Zijian, et al.
Published: (2023)

Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
by: Zhou, Yucheng, et al.
Published: (2025)

Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models
by: Samuel, Dvir, et al.
Published: (2023)

TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
by: Ozaki, Shintaro, et al.
Published: (2025)

Generating Multimodal Images with GAN: Integrating Text, Image, and Style
by: Tan, Chaoyi, et al.
Published: (2025)

Text to Image Generation and Editing: A Survey
by: Yang, Pengfei, et al.
Published: (2025)

Resolving the Identity Crisis in Text-to-Image Generation
by: Borse, Shubhankar, et al.
Published: (2025)

Conditional Text-to-Image Generation with Reference Guidance
by: Kim, Taewook, et al.
Published: (2024)

Text-Image Conditioned 3D Generation
by: Cen, Jiazhong, et al.
Published: (2026)

Personalizing Text-to-Image Generation to Individual Taste
by: Maerten, Anne-Sofie, et al.
Published: (2026)

Rich Human Feedback for Text-to-Image Generation
by: Liang, Youwei, et al.
Published: (2023)