:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ma, Shichao, Guo, Yunhe, Su, Jiahao, Huang, Qihe, Zhou, Zhengyang, Wang, Yang
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2508.06916
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

MT-EditFlow: Reinforcement Learning for Multi-Turn Image Editing with Flow Matching
von: Huang, Jiahui, et al.
Veröffentlicht: (2026)

An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing
von: Liang, Zihan, et al.
Veröffentlicht: (2025)

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation
von: Chen, Peng, et al.
Veröffentlicht: (2026)

FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing
von: Liao, Yucheng, et al.
Veröffentlicht: (2025)

FaceEditTalker: Controllable Talking Head Generation with Facial Attribute Editing
von: Feng, Guanwen, et al.
Veröffentlicht: (2025)

Multi-turn Consistent Image Editing
von: Zhou, Zijun, et al.
Veröffentlicht: (2025)

CREA: A Collaborative Multi-Agent Framework for Creative Image Editing and Generation
von: Venkatesh, Kavana, et al.
Veröffentlicht: (2025)

Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty
von: Hahn, Meera, et al.
Veröffentlicht: (2024)

CoSTA$\ast$: Cost-Sensitive Toolpath Agent for Multi-turn Image Editing
von: Gupta, Advait, et al.
Veröffentlicht: (2025)

Towards Generalized Multi-Image Editing for Unified Multimodal Models
von: Xu, Pengcheng, et al.
Veröffentlicht: (2026)

Rethinking Scribble-Guided Image Editing: Generalization, Instruction Adherence, and Multi-Tasking
von: Xu, Mingyi, et al.
Veröffentlicht: (2026)

Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions
von: Ma, Chenrui, et al.
Veröffentlicht: (2025)

Text to Image Generation and Editing: A Survey
von: Yang, Pengfei, et al.
Veröffentlicht: (2025)

ParallelEdits: Efficient Multi-object Image Editing
von: Huang, Mingzhen, et al.
Veröffentlicht: (2024)

UniRef-Image-Edit: Towards Scalable and Consistent Multi-Reference Image Editing
von: Wei, Hongyang, et al.
Veröffentlicht: (2026)

MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis
von: Zhou, Dewei, et al.
Veröffentlicht: (2024)

SOEDiff: Efficient Distillation for Small Object Editing
von: Wu, Yiming, et al.
Veröffentlicht: (2024)

SeedEdit: Align Image Re-Generation to Image Editing
von: Shi, Yichun, et al.
Veröffentlicht: (2024)

MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models
von: Zhu, Hongyang, et al.
Veröffentlicht: (2025)

ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework
von: Chen, Guanzhou, et al.
Veröffentlicht: (2026)

MIFO: Learning and Synthesizing Multi-Instance from One Image
von: Su, Kailun, et al.
Veröffentlicht: (2025)

Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios
von: Tang, Mingwei, et al.
Veröffentlicht: (2025)

FaSTA$^*$: Fast-Slow Toolpath Agent with Subroutine Mining for Efficient Multi-turn Image Editing
von: Gupta, Advait, et al.
Veröffentlicht: (2025)

M3: High-fidelity Text-to-Image Generation via Multi-Modal, Multi-Agent and Multi-Round Visual Reasoning
von: Yang, Bangji, et al.
Veröffentlicht: (2026)

MSRAMIE: Multimodal Structured Reasoning Agent for Multi-instruction Image Editing
von: Qiu, Zhaoyuan, et al.
Veröffentlicht: (2026)

CAMEO: A Conditional and Quality-Aware Multi-Agent Image Editing Orchestrator
von: Pu, Yuhan, et al.
Veröffentlicht: (2026)

CCA: Collaborative Competitive Agents for Image Editing
von: Hang, Tiankai, et al.
Veröffentlicht: (2024)

MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis
von: Zhou, Dewei, et al.
Veröffentlicht: (2024)

Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
von: Mohebbi, Hossein, et al.
Veröffentlicht: (2025)

Training-Free Multi-Concept Image Editing
von: Foteinopoulou, Niki, et al.
Veröffentlicht: (2026)

3D-aware Image Generation and Editing with Multi-modal Conditions
von: Li, Bo, et al.
Veröffentlicht: (2024)

MV-Adapter: Multi-view Consistent Image Generation Made Easy
von: Huang, Zehuan, et al.
Veröffentlicht: (2024)

IMAGAgent: Orchestrating Multi-Turn Image Editing via Constraint-Aware Planning and Reflection
von: Shen, Fei, et al.
Veröffentlicht: (2026)

Draw ALL Your Imagine: A Holistic Benchmark and Agent Framework for Complex Instruction-based Image Generation
von: Zhou, Yucheng, et al.
Veröffentlicht: (2025)

SciLT: Long-tailed Image Classification under Scientific Image Domains
von: Chen, Jiahao, et al.
Veröffentlicht: (2026)

Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings
von: Du, Xusheng, et al.
Veröffentlicht: (2025)

Improving the Classification Effect of Clinical Images of Diseases for Multi-Source Privacy Protection
von: Bowen, Tian, et al.
Veröffentlicht: (2024)

MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing
von: Yang, Chuang, et al.
Veröffentlicht: (2024)

Instruction Guided Multi Object Image Editing with Quantity and Layout Consistency
von: Tan, Jiaqi, et al.
Veröffentlicht: (2025)

3D-VirtFusion: Synthetic 3D Data Augmentation through Generative Diffusion Models and Controllable Editing
von: Dong, Shichao, et al.
Veröffentlicht: (2024)