Saved in:
| Main Authors: | Zhou, Jun, Li, Jiahao, Xu, Zunnan, Li, Hanhui, Cheng, Yiji, Hong, Fa-Ting, Lin, Qin, Lu, Qinglin, Liang, Xiaodan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.19839 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
by: Hong, Fa-Ting, et al.
Published: (2025)
by: Hong, Fa-Ting, et al.
Published: (2025)
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
by: Zhao, Haozhe, et al.
Published: (2024)
by: Zhao, Haozhe, et al.
Published: (2024)
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
by: Huang, Xuan, et al.
Published: (2024)
by: Huang, Xuan, et al.
Published: (2024)
Generative Visual Chain-of-Thought for Image Editing
by: Yin, Zijin, et al.
Published: (2026)
by: Yin, Zijin, et al.
Published: (2026)
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
by: Zarei, Arman, et al.
Published: (2025)
by: Zarei, Arman, et al.
Published: (2025)
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
by: Huang, Xuan, et al.
Published: (2024)
by: Huang, Xuan, et al.
Published: (2024)
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
by: Yang, Ling, et al.
Published: (2024)
by: Yang, Ling, et al.
Published: (2024)
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
by: Xiao, Yicheng, et al.
Published: (2026)
by: Xiao, Yicheng, et al.
Published: (2026)
Meta-CoT: Enhancing Granularity and Generalization in Image Editing
by: Zhang, Shiyi, et al.
Published: (2026)
by: Zhang, Shiyi, et al.
Published: (2026)
SuperEdit: Rectifying and Facilitating Supervision for Instruction-Based Image Editing
by: Li, Ming, et al.
Published: (2025)
by: Li, Ming, et al.
Published: (2025)
HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
by: Xu, Zunnan, et al.
Published: (2025)
by: Xu, Zunnan, et al.
Published: (2025)
FREE-Edit: Using Editing-aware Injection in Rectified Flow Models for Zero-shot Image-Driven Video Editing
by: Li, Maomao, et al.
Published: (2026)
by: Li, Maomao, et al.
Published: (2026)
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning
by: He, Qingdong, et al.
Published: (2025)
by: He, Qingdong, et al.
Published: (2025)
MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks
by: Li, Mingsong, et al.
Published: (2025)
by: Li, Mingsong, et al.
Published: (2025)
SpotEdit: Selective Region Editing in Diffusion Transformers
by: Qin, Zhibin, et al.
Published: (2025)
by: Qin, Zhibin, et al.
Published: (2025)
InsightEdit: Towards Better Instruction Following for Image Editing
by: Xu, Yingjing, et al.
Published: (2024)
by: Xu, Yingjing, et al.
Published: (2024)
Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing
by: Yeh, Chun-Hsiao, et al.
Published: (2025)
by: Yeh, Chun-Hsiao, et al.
Published: (2025)
Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing
by: Wang, Hanhui, et al.
Published: (2024)
by: Wang, Hanhui, et al.
Published: (2024)
InstructEdit: Instruction-based Knowledge Editing for Large Language Models
by: Zhang, Ningyu, et al.
Published: (2024)
by: Zhang, Ningyu, et al.
Published: (2024)
An LLM-LVLM Driven Agent for Iterative and Fine-Grained Image Editing
by: Liang, Zihan, et al.
Published: (2025)
by: Liang, Zihan, et al.
Published: (2025)
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
by: Cheng, Junhao, et al.
Published: (2024)
by: Cheng, Junhao, et al.
Published: (2024)
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing
by: Ge, Yuying, et al.
Published: (2024)
by: Ge, Yuying, et al.
Published: (2024)
CogniEdit: Dense Gradient Flow Optimization for Fine-Grained Image Editing
by: Li, Yan, et al.
Published: (2025)
by: Li, Yan, et al.
Published: (2025)
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing
by: Bai, Jinbin, et al.
Published: (2024)
by: Bai, Jinbin, et al.
Published: (2024)
HyperEdit: Unlocking Instruction-based Text Editing in LLMs via Hypernetworks
by: Zeng, Yiming, et al.
Published: (2025)
by: Zeng, Yiming, et al.
Published: (2025)
LocateEdit-Bench: A Benchmark for Instruction-Based Editing Localization
by: Wu, Shiyu, et al.
Published: (2026)
by: Wu, Shiyu, et al.
Published: (2026)
FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing
by: Wu, Junyi, et al.
Published: (2025)
by: Wu, Junyi, et al.
Published: (2025)
Re-Align: Structured Reasoning-guided Alignment for In-Context Image Generation and Editing
by: He, Runze, et al.
Published: (2026)
by: He, Runze, et al.
Published: (2026)
SmartFreeEdit: Mask-Free Spatial-Aware Image Editing with Complex Instruction Understanding
by: Sun, Qianqian, et al.
Published: (2025)
by: Sun, Qianqian, et al.
Published: (2025)
InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation
by: Lin, Yukang, et al.
Published: (2025)
by: Lin, Yukang, et al.
Published: (2025)
Robust Fusion Controller: Degradation-aware Image Fusion with Fine-grained Language Instructions
by: Zhang, Hao, et al.
Published: (2025)
by: Zhang, Hao, et al.
Published: (2025)
Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions
by: Ma, Chenrui, et al.
Published: (2025)
by: Ma, Chenrui, et al.
Published: (2025)
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving
by: Huang, Jiehui, et al.
Published: (2024)
by: Huang, Jiehui, et al.
Published: (2024)
Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
by: Lin, Yiqi, et al.
Published: (2026)
by: Lin, Yiqi, et al.
Published: (2026)
Edit As You Wish: Video Caption Editing with Multi-grained User Control
by: Yao, Linli, et al.
Published: (2023)
by: Yao, Linli, et al.
Published: (2023)
Region-Constraint In-Context Generation for Instructional Video Editing
by: Zhang, Zhongwei, et al.
Published: (2025)
by: Zhang, Zhongwei, et al.
Published: (2025)
StructDiff: Structure-aware Diffusion Model for 3D Fine-grained Medical Image Synthesis
by: Xia, Jiahao, et al.
Published: (2025)
by: Xia, Jiahao, et al.
Published: (2025)
GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections
by: Zhang, Shiyue, et al.
Published: (2024)
by: Zhang, Shiyue, et al.
Published: (2024)
PhysEdit: Physically-Consistent Region-Aware Image Editing via Adaptive Spatio-Temporal Reasoning
by: Li, Guandong, et al.
Published: (2026)
by: Li, Guandong, et al.
Published: (2026)
Visual Autoregressive Modeling for Instruction-Guided Image Editing
by: Mao, Qingyang, et al.
Published: (2025)
by: Mao, Qingyang, et al.
Published: (2025)
Similar Items
-
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation
by: Hong, Fa-Ting, et al.
Published: (2025) -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
by: Zhao, Haozhe, et al.
Published: (2024) -
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands
by: Huang, Xuan, et al.
Published: (2024) -
Generative Visual Chain-of-Thought for Image Editing
by: Yin, Zijin, et al.
Published: (2026) -
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
by: Zarei, Arman, et al.
Published: (2025)