Saved in:
Bibliographic Details
Main Authors: Zhenyuan, Chen, Zechuan, Zhang, Feng, Zhang
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.13708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909053643390976
author Zhenyuan, Chen
Zechuan, Zhang
Feng, Zhang
author_facet Zhenyuan, Chen
Zechuan, Zhang
Feng, Zhang
contents In this paper, we explore text-guided image editing in the remote sensing domain using generative modeling. We propose \rsedit, a collection of models from U-Net to DiT with various configurations. Specifically, we present the first comprehensive study of conditioning strategies for building image editing models from off-the-shelf text-to-image ones. Our experiments show that \rsedit achieves the best instruction-faithful edits while preserving geospatial structure. We release the code at \url{https://github.com/Bili-Sakura/RSEdit-Preview} and checkpoints at \url{https://huggingface.co/collections/BiliSakura/rsedit}.
format Preprint
id arxiv_https___arxiv_org_abs_2603_13708
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle RSEdit: Text-Guided Image Editing for Remote Sensing
Zhenyuan, Chen
Zechuan, Zhang
Feng, Zhang
Computer Vision and Pattern Recognition
In this paper, we explore text-guided image editing in the remote sensing domain using generative modeling. We propose \rsedit, a collection of models from U-Net to DiT with various configurations. Specifically, we present the first comprehensive study of conditioning strategies for building image editing models from off-the-shelf text-to-image ones. Our experiments show that \rsedit achieves the best instruction-faithful edits while preserving geospatial structure. We release the code at \url{https://github.com/Bili-Sakura/RSEdit-Preview} and checkpoints at \url{https://huggingface.co/collections/BiliSakura/rsedit}.
title RSEdit: Text-Guided Image Editing for Remote Sensing
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2603.13708