Saved in:
Bibliographic Details
Main Authors: Liu, Yong, Xiao, Wenpeng, Wang, Qianqian, Chen, Junlin, Wang, Shiyin, Wang, Yitong, Wu, Xinglong, Tang, Yansong
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.14549
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908410780319744
author Liu, Yong
Xiao, Wenpeng
Wang, Qianqian
Chen, Junlin
Wang, Shiyin
Wang, Yitong
Wu, Xinglong
Tang, Yansong
author_facet Liu, Yong
Xiao, Wenpeng
Wang, Qianqian
Chen, Junlin
Wang, Shiyin
Wang, Yitong
Wu, Xinglong
Tang, Yansong
contents We introduce a model named DreamLight for universal image relighting in this work, which can seamlessly composite subjects into a new background while maintaining aesthetic uniformity in terms of lighting and color tone. The background can be specified by natural images (image-based relighting) or generated from unlimited text prompts (text-based relighting). Existing studies primarily focus on image-based relighting, while with scant exploration into text-based scenarios. Some works employ intricate disentanglement pipeline designs relying on environment maps to provide relevant information, which grapples with the expensive data cost required for intrinsic decomposition and light source. Other methods take this task as an image translation problem and perform pixel-level transformation with autoencoder architecture. While these methods have achieved decent harmonization effects, they struggle to generate realistic and natural light interaction effects between the foreground and background. To alleviate these challenges, we reorganize the input data into a unified format and leverage the semantic prior provided by the pretrained diffusion model to facilitate the generation of natural results. Moreover, we propose a Position-Guided Light Adapter (PGLA) that condenses light information from different directions in the background into designed light query embeddings, and modulates the foreground with direction-biased masked attention. In addition, we present a post-processing module named Spectral Foreground Fixer (SFF) to adaptively reorganize different frequency components of subject and relighted background, which helps enhance the consistency of foreground appearance. Extensive comparisons and user study demonstrate that our DreamLight achieves remarkable relighting performance.
format Preprint
id arxiv_https___arxiv_org_abs_2506_14549
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DreamLight: Towards Harmonious and Consistent Image Relighting
Liu, Yong
Xiao, Wenpeng
Wang, Qianqian
Chen, Junlin
Wang, Shiyin
Wang, Yitong
Wu, Xinglong
Tang, Yansong
Computer Vision and Pattern Recognition
We introduce a model named DreamLight for universal image relighting in this work, which can seamlessly composite subjects into a new background while maintaining aesthetic uniformity in terms of lighting and color tone. The background can be specified by natural images (image-based relighting) or generated from unlimited text prompts (text-based relighting). Existing studies primarily focus on image-based relighting, while with scant exploration into text-based scenarios. Some works employ intricate disentanglement pipeline designs relying on environment maps to provide relevant information, which grapples with the expensive data cost required for intrinsic decomposition and light source. Other methods take this task as an image translation problem and perform pixel-level transformation with autoencoder architecture. While these methods have achieved decent harmonization effects, they struggle to generate realistic and natural light interaction effects between the foreground and background. To alleviate these challenges, we reorganize the input data into a unified format and leverage the semantic prior provided by the pretrained diffusion model to facilitate the generation of natural results. Moreover, we propose a Position-Guided Light Adapter (PGLA) that condenses light information from different directions in the background into designed light query embeddings, and modulates the foreground with direction-biased masked attention. In addition, we present a post-processing module named Spectral Foreground Fixer (SFF) to adaptively reorganize different frequency components of subject and relighted background, which helps enhance the consistency of foreground appearance. Extensive comparisons and user study demonstrate that our DreamLight achieves remarkable relighting performance.
title DreamLight: Towards Harmonious and Consistent Image Relighting
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2506.14549