Guardado en:
| Autores principales: | Li, Yan, Liu, Lin, Zhang, Xiaopeng, Xue, Wei, Luo, Wenhan, Guo, Yike, Tian, Qi |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.13276 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
FineEdit: Fine-Grained Image Edit with Bounding Box Guidance
por: Xu, Haohang, et al.
Publicado: (2026)
por: Xu, Haohang, et al.
Publicado: (2026)
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
por: Xiao, Yicheng, et al.
Publicado: (2026)
por: Xiao, Yicheng, et al.
Publicado: (2026)
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
por: Zarei, Arman, et al.
Publicado: (2025)
por: Zarei, Arman, et al.
Publicado: (2025)
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
por: Zhao, Haozhe, et al.
Publicado: (2024)
por: Zhao, Haozhe, et al.
Publicado: (2024)
FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions
por: Zhao, Peisen, et al.
Publicado: (2026)
por: Zhao, Peisen, et al.
Publicado: (2026)
Reasoning to Align: Implicit Reasoning in Diffusion Transformers for Video Editing
por: Li, Yan, et al.
Publicado: (2026)
por: Li, Yan, et al.
Publicado: (2026)
PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models
por: Cvejic, Aleksandar, et al.
Publicado: (2025)
por: Cvejic, Aleksandar, et al.
Publicado: (2025)
SpotEdit: Evaluating Visually-Guided Image Editing Methods
por: Ghazanfari, Sara, et al.
Publicado: (2025)
por: Ghazanfari, Sara, et al.
Publicado: (2025)
O-DisCo-Edit: Object Distortion Control for Unified Realistic Video Editing
por: Chen, Yuqing, et al.
Publicado: (2025)
por: Chen, Yuqing, et al.
Publicado: (2025)
CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation
por: Li, Peng, et al.
Publicado: (2025)
por: Li, Peng, et al.
Publicado: (2025)
SINGER: Vivid Audio-driven Singing Video Generation with Multi-scale Spectral Diffusion Model
por: Li, Yan, et al.
Publicado: (2024)
por: Li, Yan, et al.
Publicado: (2024)
ComprehendEdit: A Comprehensive Dataset and Evaluation Framework for Multimodal Knowledge Editing
por: Ma, Yaohui, et al.
Publicado: (2024)
por: Ma, Yaohui, et al.
Publicado: (2024)
Edit2Perceive: Image Editing Diffusion Models Are Strong Dense Perceivers
por: Shi, Yiqing, et al.
Publicado: (2025)
por: Shi, Yiqing, et al.
Publicado: (2025)
Attention Hijacking: Response Manipulation Across Queries in Vision-Language Models
por: Wang, Zhiqiang, et al.
Publicado: (2026)
por: Wang, Zhiqiang, et al.
Publicado: (2026)
Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
por: Cai, Yiyang, et al.
Publicado: (2024)
por: Cai, Yiyang, et al.
Publicado: (2024)
VFX Creator: Animated Visual Effect Generation with Controllable Diffusion Transformer
por: Liu, Xinyu, et al.
Publicado: (2025)
por: Liu, Xinyu, et al.
Publicado: (2025)
FlexEdit: Marrying Free-Shape Masks to VLLM for Flexible Image Editing
por: Yuan, Tianshuo, et al.
Publicado: (2024)
por: Yuan, Tianshuo, et al.
Publicado: (2024)
AutoEdit: Automatic Hyperparameter Tuning for Image Editing
por: Pham, Chau, et al.
Publicado: (2025)
por: Pham, Chau, et al.
Publicado: (2025)
ImgEdit: A Unified Image Editing Dataset and Benchmark
por: Ye, Yang, et al.
Publicado: (2025)
por: Ye, Yang, et al.
Publicado: (2025)
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
por: Chen, Zhi, et al.
Publicado: (2024)
por: Chen, Zhi, et al.
Publicado: (2024)
Co$^{3}$Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion
por: Qi, Xingqun, et al.
Publicado: (2025)
por: Qi, Xingqun, et al.
Publicado: (2025)
DiT4Edit: Diffusion Transformer for Image Editing
por: Feng, Kunyu, et al.
Publicado: (2024)
por: Feng, Kunyu, et al.
Publicado: (2024)
MT-EditFlow: Reinforcement Learning for Multi-Turn Image Editing with Flow Matching
por: Huang, Jiahui, et al.
Publicado: (2026)
por: Huang, Jiahui, et al.
Publicado: (2026)
Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing
por: Lin, Haonan, et al.
Publicado: (2024)
por: Lin, Haonan, et al.
Publicado: (2024)
PostEdit: Posterior Sampling for Efficient Zero-Shot Image Editing
por: Tian, Feng, et al.
Publicado: (2024)
por: Tian, Feng, et al.
Publicado: (2024)
AdaEdit: Adaptive Temporal and Channel Modulation for Flow-Based Image Editing
por: Li, Guandong, et al.
Publicado: (2026)
por: Li, Guandong, et al.
Publicado: (2026)
DirectEdit: Step-Level Accurate Inversion for Flow-Based Image Editing
por: Yang, Desong, et al.
Publicado: (2026)
por: Yang, Desong, et al.
Publicado: (2026)
Edit Transfer: Learning Image Editing via Vision In-Context Relations
por: Chen, Lan, et al.
Publicado: (2025)
por: Chen, Lan, et al.
Publicado: (2025)
InsightEdit: Towards Better Instruction Following for Image Editing
por: Xu, Yingjing, et al.
Publicado: (2024)
por: Xu, Yingjing, et al.
Publicado: (2024)
FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model
por: Zhou, Jun, et al.
Publicado: (2025)
por: Zhou, Jun, et al.
Publicado: (2025)
UniEdit-Flow: Unleashing Inversion and Editing in the Era of Flow Models
por: Jiao, Guanlong, et al.
Publicado: (2025)
por: Jiao, Guanlong, et al.
Publicado: (2025)
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow
por: Gong, Yiming, et al.
Publicado: (2025)
por: Gong, Yiming, et al.
Publicado: (2025)
EditCLIP: Representation Learning for Image Editing
por: Wang, Qian, et al.
Publicado: (2025)
por: Wang, Qian, et al.
Publicado: (2025)
Can VLMs Detect and Localize Fine-Grained AI-Edited Images?
por: Sun, Zhen, et al.
Publicado: (2025)
por: Sun, Zhen, et al.
Publicado: (2025)
Edit-GRPO: A Locality-Preserving Policy Optimization Framework for Image Editing
por: Xu, Shaodong, et al.
Publicado: (2026)
por: Xu, Shaodong, et al.
Publicado: (2026)
TiNO-Edit: Timestep and Noise Optimization for Robust Diffusion-Based Image Editing
por: Chen, Sherry X., et al.
Publicado: (2024)
por: Chen, Sherry X., et al.
Publicado: (2024)
EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing
por: Chen, Lan, et al.
Publicado: (2026)
por: Chen, Lan, et al.
Publicado: (2026)
ChordEdit: One-Step Low-Energy Transport for Image Editing
por: Lu, Liangsi, et al.
Publicado: (2026)
por: Lu, Liangsi, et al.
Publicado: (2026)
Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
por: Liu, Xinyu, et al.
Publicado: (2025)
por: Liu, Xinyu, et al.
Publicado: (2025)
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images
por: Hoe, Jiun Tian, et al.
Publicado: (2025)
por: Hoe, Jiun Tian, et al.
Publicado: (2025)
Ejemplares similares
-
FineEdit: Fine-Grained Image Edit with Bounding Box Guidance
por: Xu, Haohang, et al.
Publicado: (2026) -
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
por: Xiao, Yicheng, et al.
Publicado: (2026) -
SliderEdit: Continuous Image Editing with Fine-Grained Instruction Control
por: Zarei, Arman, et al.
Publicado: (2025) -
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
por: Zhao, Haozhe, et al.
Publicado: (2024) -
FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions
por: Zhao, Peisen, et al.
Publicado: (2026)