Saved in:
Bibliographic Details
Main Authors: Zhenyuan, Chen, Zechuan, Zhang, Feng, Zhang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.13708
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • In this paper, we explore text-guided image editing in the remote sensing domain using generative modeling. We propose \rsedit, a collection of models from U-Net to DiT with various configurations. Specifically, we present the first comprehensive study of conditioning strategies for building image editing models from off-the-shelf text-to-image ones. Our experiments show that \rsedit achieves the best instruction-faithful edits while preserving geospatial structure. We release the code at \url{https://github.com/Bili-Sakura/RSEdit-Preview} and checkpoints at \url{https://huggingface.co/collections/BiliSakura/rsedit}.