Saved in:
| Main Authors: | Luo, Yan, Aidara, Ahmadou, Lu, Jingyi, Moebel, Jeremy, Han, Kai, Wang, Mengyu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.15661 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
by: Chen, Pengtao, et al.
Published: (2025)
by: Chen, Pengtao, et al.
Published: (2025)
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
by: Xie, Weiyan, et al.
Published: (2025)
by: Xie, Weiyan, et al.
Published: (2025)
GeoWorld-VLM: Geometry from World Models for Vision-Language Models
by: Gu, Renjie, et al.
Published: (2026)
by: Gu, Renjie, et al.
Published: (2026)
MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance
by: Bai, Xuehai, et al.
Published: (2026)
by: Bai, Xuehai, et al.
Published: (2026)
When Test-Time Guidance Is Enough: Fast Image and Video Editing with Diffusion Guidance
by: Ghorbel, Ahmed, et al.
Published: (2026)
by: Ghorbel, Ahmed, et al.
Published: (2026)
GEN3D: Generating Domain-Free 3D Scenes from a Single Image
by: Zhang, Yuxin, et al.
Published: (2025)
by: Zhang, Yuxin, et al.
Published: (2025)
Dual-Channel Attention Guidance for Training-Free Image Editing Control in Diffusion Transformers
by: Li, Guandong
Published: (2026)
by: Li, Guandong
Published: (2026)
Inline Critic Steers Image Editing
by: Kang, Weitai, et al.
Published: (2026)
by: Kang, Weitai, et al.
Published: (2026)
Hierarchical Concept-to-Appearance Guidance for Multi-Subject Image Generation
by: Xu, Yijia, et al.
Published: (2026)
by: Xu, Yijia, et al.
Published: (2026)
Improving Diffusion-Based Image Editing Faithfulness via Guidance and Scheduling
by: Cho, Hansam, et al.
Published: (2025)
by: Cho, Hansam, et al.
Published: (2025)
Consistent Video Editing as Flow-Driven Image-to-Video Generation
by: Wang, Ge, et al.
Published: (2025)
by: Wang, Ge, et al.
Published: (2025)
VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image
by: Hsiao, Teng-Fang, et al.
Published: (2026)
by: Hsiao, Teng-Fang, et al.
Published: (2026)
From Scale to Speed: Adaptive Test-Time Scaling for Image Editing
by: Qu, Xiangyan, et al.
Published: (2026)
by: Qu, Xiangyan, et al.
Published: (2026)
Tuning-free Instruction-based Video Editing Via Structural Noise Initialization and Guidance
by: Wu, Song, et al.
Published: (2026)
by: Wu, Song, et al.
Published: (2026)
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
by: Zhang, Zhixing, et al.
Published: (2022)
by: Zhang, Zhixing, et al.
Published: (2022)
Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
by: Lin, Yiqi, et al.
Published: (2026)
by: Lin, Yiqi, et al.
Published: (2026)
Reflexive Guidance: Improving OoDD in Vision-Language Models via Self-Guided Image-Adaptive Concept Generation
by: Kim, Jihyo, et al.
Published: (2024)
by: Kim, Jihyo, et al.
Published: (2024)
UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)
by: Wang, Dianyi, et al.
Published: (2026)
MMIF-AMIN: Adaptive Loss-Driven Multi-Scale Invertible Dense Network for Multimodal Medical Image Fusion
by: Luo, Tao, et al.
Published: (2025)
by: Luo, Tao, et al.
Published: (2025)
FreeEdit: Mask-free Reference-based Image Editing with Multi-modal Instruction
by: He, Runze, et al.
Published: (2024)
by: He, Runze, et al.
Published: (2024)
Guidance Matters: Rethinking the Evaluation Pitfall for Text-to-Image Generation
by: Xie, Dian, et al.
Published: (2026)
by: Xie, Dian, et al.
Published: (2026)
Image-POSER: Reflective RL for Multi-Expert Image Generation and Editing
by: Mohebbi, Hossein, et al.
Published: (2025)
by: Mohebbi, Hossein, et al.
Published: (2025)
Single Image Iterative Subject-driven Generation and Editing
by: Shpitzer, Yair, et al.
Published: (2025)
by: Shpitzer, Yair, et al.
Published: (2025)
Nexus-Gen: Unified Image Understanding, Generation, and Editing via Prefilled Autoregression in Shared Embedding Space
by: Zhang, Hong, et al.
Published: (2025)
by: Zhang, Hong, et al.
Published: (2025)
Normalization-Equivariant Neural Networks with Application to Image Denoising
by: Herbreteau, Sébastien, et al.
Published: (2023)
by: Herbreteau, Sébastien, et al.
Published: (2023)
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)
by: Wang, Dianyi, et al.
Published: (2026)
Image-Goal Navigation Using Refined Feature Guidance and Scene Graph Enhancement
by: Feng, Zhicheng, et al.
Published: (2025)
by: Feng, Zhicheng, et al.
Published: (2025)
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing
by: Chen, Zhihong, et al.
Published: (2025)
by: Chen, Zhihong, et al.
Published: (2025)
OneActor: Consistent Character Generation via Cluster-Conditioned Guidance
by: Wang, Jiahao, et al.
Published: (2024)
by: Wang, Jiahao, et al.
Published: (2024)
RegionDrag: Fast Region-Based Image Editing with Diffusion Models
by: Lu, Jingyi, et al.
Published: (2024)
by: Lu, Jingyi, et al.
Published: (2024)
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance
by: Luo, Yuxuan, et al.
Published: (2025)
by: Luo, Yuxuan, et al.
Published: (2025)
DescriptorMedSAM: Language-Image Fusion with Multi-Aspect Text Guidance for Medical Image Segmentation
by: Zhang, Wenjie, et al.
Published: (2025)
by: Zhang, Wenjie, et al.
Published: (2025)
Visual Generation Without Guidance
by: Chen, Huayu, et al.
Published: (2025)
by: Chen, Huayu, et al.
Published: (2025)
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation
by: Chen, Siqi, et al.
Published: (2025)
by: Chen, Siqi, et al.
Published: (2025)
Understanding Generative AI Capabilities in Everyday Image Editing Tasks
by: Taesiri, Mohammad Reza, et al.
Published: (2025)
by: Taesiri, Mohammad Reza, et al.
Published: (2025)
An Interpretable Local Editing Model for Counterfactual Medical Image Generation
by: Min, Hyungi, et al.
Published: (2026)
by: Min, Hyungi, et al.
Published: (2026)
Scale-Aware Relay and Scale-Adaptive Loss for Tiny Object Detection in Aerial Images
by: Li, Jinfu, et al.
Published: (2025)
by: Li, Jinfu, et al.
Published: (2025)
MieDB-100k: A Comprehensive Dataset for Medical Image Editing
by: Lai, Yongfan, et al.
Published: (2026)
by: Lai, Yongfan, et al.
Published: (2026)
BrainDreamer: Reasoning-Coherent and Controllable Image Generation from EEG Brain Signals via Language Guidance
by: Wang, Ling, et al.
Published: (2024)
by: Wang, Ling, et al.
Published: (2024)
FODA-PG for Enhanced Medical Imaging Narrative Generation: Adaptive Differentiation of Normal and Abnormal Attributes
by: Shu, Kai, et al.
Published: (2024)
by: Shu, Kai, et al.
Published: (2024)
Similar Items
-
RegionE: Adaptive Region-Aware Generation for Efficient Image Editing
by: Chen, Pengtao, et al.
Published: (2025) -
CannyEdit: Selective Canny Control and Dual-Prompt Guidance for Training-Free Image Editing
by: Xie, Weiyan, et al.
Published: (2025) -
GeoWorld-VLM: Geometry from World Models for Vision-Language Models
by: Gu, Renjie, et al.
Published: (2026) -
MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance
by: Bai, Xuehai, et al.
Published: (2026) -
When Test-Time Guidance Is Enough: Fast Image and Video Editing with Diffusion Guidance
by: Ghorbel, Ahmed, et al.
Published: (2026)