Saved in:
| Main Authors: | Jiang, Jing, Ling, Yiran, Li, Binzhu, Li, Pengxiang, Piao, Junming, Zhang, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.06196 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Pixels, Patterns, but No Poetry: To See The World like Humans
by: Gao, Hongcheng, et al.
Published: (2025)
by: Gao, Hongcheng, et al.
Published: (2025)
Single Image Iterative Subject-driven Generation and Editing
by: Shpitzer, Yair, et al.
Published: (2025)
by: Shpitzer, Yair, et al.
Published: (2025)
Seeing the Poem: Image-Semantic Detection of AI-Generated Modern Chinese Poetry with MLLMs
by: Wang, Shanshan, et al.
Published: (2026)
by: Wang, Shanshan, et al.
Published: (2026)
Iterative Refinement Improves Compositional Image Generation
by: Jaiswal, Shantanu, et al.
Published: (2026)
by: Jaiswal, Shantanu, et al.
Published: (2026)
Iterative Adversarial Attack on Image-guided Story Ending Generation
by: Wang, Youze, et al.
Published: (2023)
by: Wang, Youze, et al.
Published: (2023)
Autoregressive Image Generation with Linear Complexity: A Spatial-Aware Decay Perspective
by: Mao, Yuxin, et al.
Published: (2025)
by: Mao, Yuxin, et al.
Published: (2025)
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image
by: Zhao, Yu, et al.
Published: (2024)
by: Zhao, Yu, et al.
Published: (2024)
DCMM-Transformer: Degree-Corrected Mixed-Membership Attention for Medical Imaging
by: Cheng, Huimin, et al.
Published: (2025)
by: Cheng, Huimin, et al.
Published: (2025)
Regeneration Based Training-free Attribution of Fake Images Generated by Text-to-Image Generative Models
by: Li, Meiling, et al.
Published: (2024)
by: Li, Meiling, et al.
Published: (2024)
ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models
by: Zhou, Qin, et al.
Published: (2025)
by: Zhou, Qin, et al.
Published: (2025)
Loupe: A Generalizable and Adaptive Framework for Image Forgery Detection
by: Jiang, Yuchu, et al.
Published: (2025)
by: Jiang, Yuchu, et al.
Published: (2025)
Knowledge Completes the Vision: A Multimodal Entity-aware Retrieval-Augmented Generation Framework for News Image Captioning
by: You, Xiaoxing, et al.
Published: (2025)
by: You, Xiaoxing, et al.
Published: (2025)
StableI2I: Spotting Unintended Changes in Image-to-Image Transition
by: Li, Jiayang, et al.
Published: (2026)
by: Li, Jiayang, et al.
Published: (2026)
GenShield: Unified Detection and Artifact Correction for AI-Generated Images
by: Xu, Zhipei, et al.
Published: (2026)
by: Xu, Zhipei, et al.
Published: (2026)
A Framework For Image Synthesis Using Supervised Contrastive Learning
by: Liu, Yibin, et al.
Published: (2024)
by: Liu, Yibin, et al.
Published: (2024)
CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
by: Zhang, Ruoxuan, et al.
Published: (2025)
by: Zhang, Ruoxuan, et al.
Published: (2025)
Culture-TRIP: Culturally-Aware Text-to-Image Generation with Iterative Prompt Refinement
by: Jeong, Suchae, et al.
Published: (2025)
by: Jeong, Suchae, et al.
Published: (2025)
Condition-Aware Neural Network for Controlled Image Generation
by: Cai, Han, et al.
Published: (2024)
by: Cai, Han, et al.
Published: (2024)
Self-Corrected Image Generation with Explainable Latent Rewards
by: Luo, Yinyi, et al.
Published: (2026)
by: Luo, Yinyi, et al.
Published: (2026)
Relative-Absolute Fusion: Rethinking Feature Extraction in Image-Based Iterative Method Selection for Solving Sparse Linear Systems
by: Zhang, Kaiqi, et al.
Published: (2025)
by: Zhang, Kaiqi, et al.
Published: (2025)
Multi-Agent Image Restoration
by: Jiang, Xu, et al.
Published: (2025)
by: Jiang, Xu, et al.
Published: (2025)
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
by: Zhao, Yiran, et al.
Published: (2026)
by: Zhao, Yiran, et al.
Published: (2026)
Improving Generalization of Medical Image Registration Foundation Model
by: Hu, Jing, et al.
Published: (2025)
by: Hu, Jing, et al.
Published: (2025)
Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis
by: Jiang, Yankai, et al.
Published: (2025)
by: Jiang, Yankai, et al.
Published: (2025)
GEBench: Benchmarking Image Generation Models as GUI Environments
by: Li, Haodong, et al.
Published: (2026)
by: Li, Haodong, et al.
Published: (2026)
REVEAL: Reasoning-Enhanced Forensic Evidence Analysis for Explainable AI-Generated Image Detection
by: Cao, Huangsen, et al.
Published: (2025)
by: Cao, Huangsen, et al.
Published: (2025)
Cross Modality Image Translation In Medical Imaging Using Generative Frameworks
by: Romoli, Giulia, et al.
Published: (2026)
by: Romoli, Giulia, et al.
Published: (2026)
IA-T2I: Internet-Augmented Text-to-Image Generation
by: Li, Chuanhao, et al.
Published: (2025)
by: Li, Chuanhao, et al.
Published: (2025)
Poetry in Pixels: Prompt Tuning for Poem Image Generation via Diffusion Models
by: Jamil, Sofia, et al.
Published: (2025)
by: Jamil, Sofia, et al.
Published: (2025)
Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case
by: Li, Yufan, et al.
Published: (2021)
by: Li, Yufan, et al.
Published: (2021)
Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation
by: Cho, Jaemin, et al.
Published: (2023)
by: Cho, Jaemin, et al.
Published: (2023)
RL-I2IT: Image-to-Image Translation with Deep Reinforcement Learning
by: Hu, Jing, et al.
Published: (2023)
by: Hu, Jing, et al.
Published: (2023)
VLM-Guided Iterative Refinement for Surgical Image Segmentation with Foundation Models
by: Lou, Ange, et al.
Published: (2026)
by: Lou, Ange, et al.
Published: (2026)
A High-Quality Dataset and Reliable Evaluation for Interleaved Image-Text Generation
by: Feng, Yukang, et al.
Published: (2025)
by: Feng, Yukang, et al.
Published: (2025)
MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation
by: Xu, Caixu, et al.
Published: (2025)
by: Xu, Caixu, et al.
Published: (2025)
Spatial-Aware Latent Initialization for Controllable Image Generation
by: Sun, Wenqiang, et al.
Published: (2024)
by: Sun, Wenqiang, et al.
Published: (2024)
VModA: An Effective Framework for Adaptive NSFW Image Moderation
by: Bao, Han, et al.
Published: (2025)
by: Bao, Han, et al.
Published: (2025)
Heterogeneous Generative Knowledge Distillation with Masked Image Modeling
by: Wang, Ziming, et al.
Published: (2023)
by: Wang, Ziming, et al.
Published: (2023)
WithAnyone: Towards Controllable and ID Consistent Image Generation
by: Xu, Hengyuan, et al.
Published: (2025)
by: Xu, Hengyuan, et al.
Published: (2025)
Compress3D: a Compressed Latent Space for 3D Generation from a Single Image
by: Zhang, Bowen, et al.
Published: (2024)
by: Zhang, Bowen, et al.
Published: (2024)
Similar Items
-
Pixels, Patterns, but No Poetry: To See The World like Humans
by: Gao, Hongcheng, et al.
Published: (2025) -
Single Image Iterative Subject-driven Generation and Editing
by: Shpitzer, Yair, et al.
Published: (2025) -
Seeing the Poem: Image-Semantic Detection of AI-Generated Modern Chinese Poetry with MLLMs
by: Wang, Shanshan, et al.
Published: (2026) -
Iterative Refinement Improves Compositional Image Generation
by: Jaiswal, Shantanu, et al.
Published: (2026) -
Iterative Adversarial Attack on Image-guided Story Ending Generation
by: Wang, Youze, et al.
Published: (2023)