Saved in:
| Main Authors: | van Peursen, Willem Th., Entsua-Mensah, Samuel E. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.02973 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Seeing The Words: Evaluating AI-generated Biblical Art
by: Makimei, Hidde, et al.
Published: (2025)
by: Makimei, Hidde, et al.
Published: (2025)
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
by: Li, You, et al.
Published: (2024)
by: Li, You, et al.
Published: (2024)
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Disease Detection from Retinal Fundus Images
by: Djoumessi, Kerol, et al.
Published: (2025)
by: Djoumessi, Kerol, et al.
Published: (2025)
Imagine yourself: Tuning-Free Personalized Image Generation
by: He, Zecheng, et al.
Published: (2024)
by: He, Zecheng, et al.
Published: (2024)
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
by: Li, Hongyu, et al.
Published: (2024)
by: Li, Hongyu, et al.
Published: (2024)
Imagine for Me: Creative Conceptual Blending of Real Images and Text via Blended Attention
by: Cho, Wonwoong, et al.
Published: (2025)
by: Cho, Wonwoong, et al.
Published: (2025)
Image-to-Text Logic Jailbreak: Your Imagination can Help You Do Anything
by: Zou, Xiaotian, et al.
Published: (2024)
by: Zou, Xiaotian, et al.
Published: (2024)
Exploring Bias in over 100 Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2025)
by: Vice, Jordan, et al.
Published: (2025)
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
by: Lyu, Yueming, et al.
Published: (2023)
by: Lyu, Yueming, et al.
Published: (2023)
Exploring Text-Guided Single Image Editing for Remote Sensing Images
by: Han, Fangzhou, et al.
Published: (2024)
by: Han, Fangzhou, et al.
Published: (2024)
SpatialImaginer: Towards Adaptive Visual Imagination for Spatial Reasoning
by: Li, Yian, et al.
Published: (2026)
by: Li, Yian, et al.
Published: (2026)
Imagining the Unseen: Generative Location Modeling for Object Placement
by: Yun, Jooyeol, et al.
Published: (2024)
by: Yun, Jooyeol, et al.
Published: (2024)
ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis
by: Sun, Zhengwentai, et al.
Published: (2026)
by: Sun, Zhengwentai, et al.
Published: (2026)
Surgical Text-to-Image Generation
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
by: Nwoye, Chinedu Innocent, et al.
Published: (2024)
Towards Interpretable Foundation Models for Retinal Fundus Images
by: Mensah, Samuel Ofosu, et al.
Published: (2026)
by: Mensah, Samuel Ofosu, et al.
Published: (2026)
Color Bind: Exploring Color Perception in Text-to-Image Models
by: Shomer-Chai, Shay, et al.
Published: (2025)
by: Shomer-Chai, Shay, et al.
Published: (2025)
TextBoost: Boosting Text Encoder for Personalized Text-to-Image Generation
by: Park, NaHyeon, et al.
Published: (2024)
by: Park, NaHyeon, et al.
Published: (2024)
OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation
by: Gunawan, Agus, et al.
Published: (2025)
by: Gunawan, Agus, et al.
Published: (2025)
Exploring Fine-Grained Image-Text Alignment for Referring Remote Sensing Image Segmentation
by: Lei, Sen, et al.
Published: (2024)
by: Lei, Sen, et al.
Published: (2024)
SeqBench: Benchmarking Sequential Narrative Generation in Text-to-Video Models
by: Tang, Zhengxu, et al.
Published: (2025)
by: Tang, Zhengxu, et al.
Published: (2025)
Text4Seg: Reimagining Image Segmentation as Text Generation
by: Lan, Mengcheng, et al.
Published: (2024)
by: Lan, Mengcheng, et al.
Published: (2024)
Diff-MM: Exploring Pre-trained Text-to-Image Generation Model for Unified Multi-modal Object Tracking
by: Xuan, Shiyu, et al.
Published: (2025)
by: Xuan, Shiyu, et al.
Published: (2025)
Prompt Refinement with Image Pivot for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)
by: Zhan, Jingtao, et al.
Published: (2024)
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models
by: Zheng, Matthew, et al.
Published: (2024)
by: Zheng, Matthew, et al.
Published: (2024)
A Review on Generative AI For Text-To-Image and Image-To-Image Generation and Implications To Scientific Images
by: Sordo, Zineb, et al.
Published: (2025)
by: Sordo, Zineb, et al.
Published: (2025)
DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
Imagine360: Immersive 360 Video Generation from Perspective Anchor
by: Tan, Jing, et al.
Published: (2024)
by: Tan, Jing, et al.
Published: (2024)
Exploring Motion-Language Alignment for Text-driven Motion Generation
by: Gu, Ruxi, et al.
Published: (2026)
by: Gu, Ruxi, et al.
Published: (2026)
Generating Intermediate Representations for Compositional Text-To-Image Generation
by: Galun, Ran, et al.
Published: (2024)
by: Galun, Ran, et al.
Published: (2024)
Exploring the Naturalness of AI-Generated Images
by: Chen, Zijian, et al.
Published: (2023)
by: Chen, Zijian, et al.
Published: (2023)
Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
by: Zhou, Yucheng, et al.
Published: (2025)
by: Zhou, Yucheng, et al.
Published: (2025)
Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models
by: Samuel, Dvir, et al.
Published: (2023)
by: Samuel, Dvir, et al.
Published: (2023)
TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
by: Ozaki, Shintaro, et al.
Published: (2025)
by: Ozaki, Shintaro, et al.
Published: (2025)
Generating Multimodal Images with GAN: Integrating Text, Image, and Style
by: Tan, Chaoyi, et al.
Published: (2025)
by: Tan, Chaoyi, et al.
Published: (2025)
Text to Image Generation and Editing: A Survey
by: Yang, Pengfei, et al.
Published: (2025)
by: Yang, Pengfei, et al.
Published: (2025)
Resolving the Identity Crisis in Text-to-Image Generation
by: Borse, Shubhankar, et al.
Published: (2025)
by: Borse, Shubhankar, et al.
Published: (2025)
Conditional Text-to-Image Generation with Reference Guidance
by: Kim, Taewook, et al.
Published: (2024)
by: Kim, Taewook, et al.
Published: (2024)
Text-Image Conditioned 3D Generation
by: Cen, Jiazhong, et al.
Published: (2026)
by: Cen, Jiazhong, et al.
Published: (2026)
Personalizing Text-to-Image Generation to Individual Taste
by: Maerten, Anne-Sofie, et al.
Published: (2026)
by: Maerten, Anne-Sofie, et al.
Published: (2026)
Rich Human Feedback for Text-to-Image Generation
by: Liang, Youwei, et al.
Published: (2023)
by: Liang, Youwei, et al.
Published: (2023)
Similar Items
-
Seeing The Words: Evaluating AI-generated Biblical Art
by: Makimei, Hidde, et al.
Published: (2025) -
Imagine and Seek: Improving Composed Image Retrieval with an Imagined Proxy
by: Li, You, et al.
Published: (2024) -
A Hybrid Fully Convolutional CNN-Transformer Model for Inherently Interpretable Disease Detection from Retinal Fundus Images
by: Djoumessi, Kerol, et al.
Published: (2025) -
Imagine yourself: Tuning-Free Personalized Image Generation
by: He, Zecheng, et al.
Published: (2024) -
Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
by: Li, Hongyu, et al.
Published: (2024)