:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chan, Wing, Allen, Richard
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.00105
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes
by: Ho, Shing-Hei, et al.
Published: (2024)

EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
by: Feng, Wenfeng, et al.
Published: (2025)

Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
by: Yang, Yujia, et al.
Published: (2026)

MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing
by: Liu, Minghao, et al.
Published: (2025)

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2024)

InEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models
by: Sheng, Zhiqiang, et al.
Published: (2026)

When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
by: Hou, Jiacheng, et al.
Published: (2026)

I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
by: Ma, Yiwei, et al.
Published: (2024)

MULTITEXTEDIT: Benchmarking Cross-Lingual Degradation in Text-in-Image Editing
by: Cheng, Liwei, et al.
Published: (2026)

MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
by: Wan, Yixin, et al.
Published: (2025)

Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance
by: Kim, Kwanyoung
Published: (2025)

WorldEdit: Towards Open-World Image Editing with a Knowledge-Informed Benchmark
by: Lin, Wang, et al.
Published: (2026)

Probing Visual Planning in Image Editing Models
by: Zhou, Zhimu, et al.
Published: (2026)

SINE: SINgle Image Editing with Text-to-Image Diffusion Models
by: Zhang, Zhixing, et al.
Published: (2022)

Concept Lancet: Image Editing with Compositional Representation Transplant
by: Luo, Jinqi, et al.
Published: (2025)

IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment
by: Sun, Shangkun, et al.
Published: (2025)

HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
by: Kim, Wonjae, et al.
Published: (2024)

$\texttt{Complex-Edit}$: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
by: Yang, Siwei, et al.
Published: (2025)

DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model
by: Hong, Shibo, et al.
Published: (2026)

Lazy Diffusion Transformer for Interactive Image Editing
by: Nitzan, Yotam, et al.
Published: (2024)

An Interpretable Local Editing Model for Counterfactual Medical Image Generation
by: Min, Hyungi, et al.
Published: (2026)

Unsupervised Region-Based Image Editing of Denoising Diffusion Models
by: Li, Zixiang, et al.
Published: (2024)

SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation
by: Chen, Siqi, et al.
Published: (2025)

PPTArena: A Benchmark for Agentic PowerPoint Editing
by: Ofengenden, Michael, et al.
Published: (2025)

EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
by: Li, Runjia, et al.
Published: (2025)

UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs
by: Jiang, Lifan, et al.
Published: (2026)

EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation
by: Han, Shuhao, et al.
Published: (2024)

Self-Attention Diffusion Models for Zero-Shot Biomedical Image Segmentation: Unlocking New Frontiers in Medical Imaging
by: Hamrani, Abderrachid, et al.
Published: (2025)

IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment
by: Qu, Bowen, et al.
Published: (2025)

Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
by: Lin, Fenfen, et al.
Published: (2025)

Inline Critic Steers Image Editing
by: Kang, Weitai, et al.
Published: (2026)

Training-Free Text-Guided Image Editing with Visual Autoregressive Model
by: Wang, Yufei, et al.
Published: (2025)

DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
by: Wang, Xiaolong, et al.
Published: (2024)

Measuring the Measurers: Quality Evaluation of Hallucination Benchmarks for Large Vision-Language Models
by: Yan, Bei, et al.
Published: (2024)

VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark
by: Huang, Han, et al.
Published: (2024)

Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark
by: Kim, Junsu, et al.
Published: (2025)

ChartM$^3$: Benchmarking Chart Editing with Multimodal Instructions
by: Yang, Donglu, et al.
Published: (2025)

Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
by: Dunlop, Connor, et al.
Published: (2025)

T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
by: Huang, Ziwei, et al.
Published: (2024)

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)