Saved in:
Bibliographic Details
Main Authors: Zhao, Zijian, Jin, Dian, Zhou, Zijing, Zhang, Xiaoyu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.01482
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915700412514304
author Zhao, Zijian
Jin, Dian
Zhou, Zijing
Zhang, Xiaoyu
author_facet Zhao, Zijian
Jin, Dian
Zhou, Zijing
Zhang, Xiaoyu
contents Stage lighting is a vital component in live music performances, shaping an engaging experience for both musicians and audiences. In recent years, Automatic Stage Lighting Control (ASLC) has attracted growing interest due to the high costs of hiring or training professional lighting engineers. However, most existing ASLC solutions only classify music into limited categories and map them to predefined light patterns, resulting in formulaic and monotonous outcomes that lack rationality. To address this gap, this paper presents Skip-BART, an end-to-end model that directly learns from experienced lighting engineers and predict vivid, human-like stage lighting. To the best of our knowledge, this is the first work to conceptualize ASLC as a generative task rather than merely a classification problem. Our method adapts the BART model to take audio music as input and produce light hue and value (intensity) as output, incorporating a novel skip connection mechanism to enhance the relationship between music and light within the frame grid. To address the lack of available datasets, we create the first stage lighting dataset, along with several pre-training and transfer learning techniques to improve model training with limited data. We validate our method through both quantitative analysis and an human evaluation, demonstrating that Skip-BART outperforms conventional rule-based methods across all evaluation metrics and shows only a limited gap compared to real lighting engineers. To support further research, we have made our self-collected dataset, code, and trained model parameters available at https://github.com/RS2002/Skip-BART .
format Preprint
id arxiv_https___arxiv_org_abs_2506_01482
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Automatic Stage Lighting Control: Is it a Rule-Driven Process or Generative Task?
Zhao, Zijian
Jin, Dian
Zhou, Zijing
Zhang, Xiaoyu
Machine Learning
Artificial Intelligence
Multimedia
Audio and Speech Processing
Stage lighting is a vital component in live music performances, shaping an engaging experience for both musicians and audiences. In recent years, Automatic Stage Lighting Control (ASLC) has attracted growing interest due to the high costs of hiring or training professional lighting engineers. However, most existing ASLC solutions only classify music into limited categories and map them to predefined light patterns, resulting in formulaic and monotonous outcomes that lack rationality. To address this gap, this paper presents Skip-BART, an end-to-end model that directly learns from experienced lighting engineers and predict vivid, human-like stage lighting. To the best of our knowledge, this is the first work to conceptualize ASLC as a generative task rather than merely a classification problem. Our method adapts the BART model to take audio music as input and produce light hue and value (intensity) as output, incorporating a novel skip connection mechanism to enhance the relationship between music and light within the frame grid. To address the lack of available datasets, we create the first stage lighting dataset, along with several pre-training and transfer learning techniques to improve model training with limited data. We validate our method through both quantitative analysis and an human evaluation, demonstrating that Skip-BART outperforms conventional rule-based methods across all evaluation metrics and shows only a limited gap compared to real lighting engineers. To support further research, we have made our self-collected dataset, code, and trained model parameters available at https://github.com/RS2002/Skip-BART .
title Automatic Stage Lighting Control: Is it a Rule-Driven Process or Generative Task?
topic Machine Learning
Artificial Intelligence
Multimedia
Audio and Speech Processing
url https://arxiv.org/abs/2506.01482