Saved in:
| Main Authors: | Chan, Wing, Allen, Richard |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00105 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes
by: Ho, Shing-Hei, et al.
Published: (2024)
by: Ho, Shing-Hei, et al.
Published: (2024)
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
by: Feng, Wenfeng, et al.
Published: (2025)
by: Feng, Wenfeng, et al.
Published: (2025)
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
by: Yang, Yujia, et al.
Published: (2026)
by: Yang, Yujia, et al.
Published: (2026)
MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing
by: Liu, Minghao, et al.
Published: (2025)
by: Liu, Minghao, et al.
Published: (2025)
On the Fairness, Diversity and Reliability of Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2024)
by: Vice, Jordan, et al.
Published: (2024)
InEdit-Bench: Benchmarking Intermediate Logical Pathways for Intelligent Image Editing Models
by: Sheng, Zhiqiang, et al.
Published: (2026)
by: Sheng, Zhiqiang, et al.
Published: (2026)
When the Prompt Becomes Visual: Vision-Centric Jailbreak Attacks for Large Image Editing Models
by: Hou, Jiacheng, et al.
Published: (2026)
by: Hou, Jiacheng, et al.
Published: (2026)
I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing
by: Ma, Yiwei, et al.
Published: (2024)
by: Ma, Yiwei, et al.
Published: (2024)
MULTITEXTEDIT: Benchmarking Cross-Lingual Degradation in Text-in-Image Editing
by: Cheng, Liwei, et al.
Published: (2026)
by: Cheng, Liwei, et al.
Published: (2026)
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
by: Wan, Yixin, et al.
Published: (2025)
by: Wan, Yixin, et al.
Published: (2025)
Toward the Frontiers of Reliable Diffusion Sampling via Adversarial Sinkhorn Attention Guidance
by: Kim, Kwanyoung
Published: (2025)
by: Kim, Kwanyoung
Published: (2025)
WorldEdit: Towards Open-World Image Editing with a Knowledge-Informed Benchmark
by: Lin, Wang, et al.
Published: (2026)
by: Lin, Wang, et al.
Published: (2026)
Probing Visual Planning in Image Editing Models
by: Zhou, Zhimu, et al.
Published: (2026)
by: Zhou, Zhimu, et al.
Published: (2026)
SINE: SINgle Image Editing with Text-to-Image Diffusion Models
by: Zhang, Zhixing, et al.
Published: (2022)
by: Zhang, Zhixing, et al.
Published: (2022)
Concept Lancet: Image Editing with Compositional Representation Transplant
by: Luo, Jinqi, et al.
Published: (2025)
by: Luo, Jinqi, et al.
Published: (2025)
IE-Bench: Advancing the Measurement of Text-Driven Image Editing for Human Perception Alignment
by: Sun, Shangkun, et al.
Published: (2025)
by: Sun, Shangkun, et al.
Published: (2025)
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
by: Kim, Wonjae, et al.
Published: (2024)
by: Kim, Wonjae, et al.
Published: (2024)
$\texttt{Complex-Edit}$: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
by: Yang, Siwei, et al.
Published: (2025)
by: Yang, Siwei, et al.
Published: (2025)
DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model
by: Hong, Shibo, et al.
Published: (2026)
by: Hong, Shibo, et al.
Published: (2026)
Lazy Diffusion Transformer for Interactive Image Editing
by: Nitzan, Yotam, et al.
Published: (2024)
by: Nitzan, Yotam, et al.
Published: (2024)
An Interpretable Local Editing Model for Counterfactual Medical Image Generation
by: Min, Hyungi, et al.
Published: (2026)
by: Min, Hyungi, et al.
Published: (2026)
Unsupervised Region-Based Image Editing of Denoising Diffusion Models
by: Li, Zixiang, et al.
Published: (2024)
by: Li, Zixiang, et al.
Published: (2024)
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation
by: Chen, Siqi, et al.
Published: (2025)
by: Chen, Siqi, et al.
Published: (2025)
PPTArena: A Benchmark for Agentic PowerPoint Editing
by: Ofengenden, Michael, et al.
Published: (2025)
by: Ofengenden, Michael, et al.
Published: (2025)
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing
by: Li, Runjia, et al.
Published: (2025)
by: Li, Runjia, et al.
Published: (2025)
UniEditBench: A Unified and Cost-Effective Benchmark for Image and Video Editing via Distilled MLLMs
by: Jiang, Lifan, et al.
Published: (2026)
by: Jiang, Lifan, et al.
Published: (2026)
EvalMuse-40K: A Reliable and Fine-Grained Benchmark with Comprehensive Human Annotations for Text-to-Image Generation Model Evaluation
by: Han, Shuhao, et al.
Published: (2024)
by: Han, Shuhao, et al.
Published: (2024)
Self-Attention Diffusion Models for Zero-Shot Biomedical Image Segmentation: Unlocking New Frontiers in Medical Imaging
by: Hamrani, Abderrachid, et al.
Published: (2025)
by: Hamrani, Abderrachid, et al.
Published: (2025)
IE-Critic-R1: Advancing the Explanatory Measurement of Text-Driven Image Editing for Human Perception Alignment
by: Qu, Bowen, et al.
Published: (2025)
by: Qu, Bowen, et al.
Published: (2025)
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
by: Lin, Fenfen, et al.
Published: (2025)
by: Lin, Fenfen, et al.
Published: (2025)
Inline Critic Steers Image Editing
by: Kang, Weitai, et al.
Published: (2026)
by: Kang, Weitai, et al.
Published: (2026)
Training-Free Text-Guided Image Editing with Visual Autoregressive Model
by: Wang, Yufei, et al.
Published: (2025)
by: Wang, Yufei, et al.
Published: (2025)
DPDEdit: Detail-Preserved Diffusion Models for Multimodal Fashion Image Editing
by: Wang, Xiaolong, et al.
Published: (2024)
by: Wang, Xiaolong, et al.
Published: (2024)
Measuring the Measurers: Quality Evaluation of Hallucination Benchmarks for Large Vision-Language Models
by: Yan, Bei, et al.
Published: (2024)
by: Yan, Bei, et al.
Published: (2024)
VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark
by: Huang, Han, et al.
Published: (2024)
by: Huang, Han, et al.
Published: (2024)
Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark
by: Kim, Junsu, et al.
Published: (2025)
by: Kim, Junsu, et al.
Published: (2025)
ChartM$^3$: Benchmarking Chart Editing with Multimodal Instructions
by: Yang, Donglu, et al.
Published: (2025)
by: Yang, Donglu, et al.
Published: (2025)
Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization
by: Dunlop, Connor, et al.
Published: (2025)
by: Dunlop, Connor, et al.
Published: (2025)
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
by: Huang, Ziwei, et al.
Published: (2024)
by: Huang, Ziwei, et al.
Published: (2024)
DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
by: Wang, Dianyi, et al.
Published: (2026)
by: Wang, Dianyi, et al.
Published: (2026)
Similar Items
-
LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes
by: Ho, Shing-Hei, et al.
Published: (2024) -
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
by: Feng, Wenfeng, et al.
Published: (2025) -
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
by: Yang, Yujia, et al.
Published: (2026) -
MedEBench: Diagnosing Reliability in Text-Guided Medical Image Editing
by: Liu, Minghao, et al.
Published: (2025) -
On the Fairness, Diversity and Reliability of Text-to-Image Generative Models
by: Vice, Jordan, et al.
Published: (2024)