Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yuxin, Zheng, Dandan, Gong, Biao, Wang, Shiwen, Chen, Jingdong, Yang, Ming, Dong, Weiming, Xu, Changsheng
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.22979
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916912512892928
author	Zhang, Yuxin Zheng, Dandan Gong, Biao Wang, Shiwen Chen, Jingdong Yang, Ming Dong, Weiming Xu, Changsheng
author_facet	Zhang, Yuxin Zheng, Dandan Gong, Biao Wang, Shiwen Chen, Jingdong Yang, Ming Dong, Weiming Xu, Changsheng
contents	Lighting plays a pivotal role in ensuring the naturalness and aesthetic quality of video generation. However, the impact of lighting is deeply coupled with other factors of videos, e.g., objects and scenes. Thus, it remains challenging to disentangle and model coherent lighting conditions independently, limiting the flexibility to control lighting in video generation. In this paper, inspired by the established controllable T2I models, we propose LumiSculpt, which enables precise and consistent lighting control in T2V generation models. LumiSculpt equips the video generation with new interactive capabilities, allowing the input of reference image sequences with customized lighting conditions. Furthermore, the core learnable plug-and-play module of LumiSculpt facilitates direct control over the intensity, position and trajectory of an assumed light source in video diffusion models. To effectively train LumiSculpt and address the issue of insufficient lighting data, we construct LumiHuman, a new lightweight and flexible dataset for portrait lighting of images and videos. Experimental results demonstrate that LumiSculpt achieves precise and high-quality lighting control in video generation. The analysis demonstrates the flexibility of LumiHuman.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_22979
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	LumiSculpt: Enabling Consistent Portrait Lighting in Video Generation Zhang, Yuxin Zheng, Dandan Gong, Biao Wang, Shiwen Chen, Jingdong Yang, Ming Dong, Weiming Xu, Changsheng Computer Vision and Pattern Recognition Lighting plays a pivotal role in ensuring the naturalness and aesthetic quality of video generation. However, the impact of lighting is deeply coupled with other factors of videos, e.g., objects and scenes. Thus, it remains challenging to disentangle and model coherent lighting conditions independently, limiting the flexibility to control lighting in video generation. In this paper, inspired by the established controllable T2I models, we propose LumiSculpt, which enables precise and consistent lighting control in T2V generation models. LumiSculpt equips the video generation with new interactive capabilities, allowing the input of reference image sequences with customized lighting conditions. Furthermore, the core learnable plug-and-play module of LumiSculpt facilitates direct control over the intensity, position and trajectory of an assumed light source in video diffusion models. To effectively train LumiSculpt and address the issue of insufficient lighting data, we construct LumiHuman, a new lightweight and flexible dataset for portrait lighting of images and videos. Experimental results demonstrate that LumiSculpt achieves precise and high-quality lighting control in video generation. The analysis demonstrates the flexibility of LumiHuman.
title	LumiSculpt: Enabling Consistent Portrait Lighting in Video Generation
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2410.22979

Similar Items