Saved in:
Bibliographic Details
Main Author: Luo, Weiliang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.18151
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918515852705792
author Luo, Weiliang
author_facet Luo, Weiliang
contents We present Music102, an advanced model aimed at enhancing chord progression accompaniment through a $D_{12}$-equivariant transformer. Inspired by group theory and symbolic music structures, Music102 leverages musical symmetry--such as transposition and reflection operations--integrating these properties into the transformer architecture. By encoding prior music knowledge, the model maintains equivariance across both melody and chord sequences. The POP909 dataset was employed to train and evaluate Music102, revealing significant improvements over the non-equivariant Music101 prototype Music101 in both weighted loss and exact accuracy metrics, despite using fewer parameters. This work showcases the adaptability of self-attention mechanisms and layer normalization to the discrete musical domain, addressing challenges in computational music analysis. With its stable and flexible neural framework, Music102 sets the stage for further exploration in equivariant music generation and computational composition tools, bridging mathematical theory with practical music performance.
format Preprint
id arxiv_https___arxiv_org_abs_2410_18151
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Music102: An $D_{12}$-equivariant transformer for chord progression accompaniment
Luo, Weiliang
Sound
Machine Learning
Multimedia
Audio and Speech Processing
We present Music102, an advanced model aimed at enhancing chord progression accompaniment through a $D_{12}$-equivariant transformer. Inspired by group theory and symbolic music structures, Music102 leverages musical symmetry--such as transposition and reflection operations--integrating these properties into the transformer architecture. By encoding prior music knowledge, the model maintains equivariance across both melody and chord sequences. The POP909 dataset was employed to train and evaluate Music102, revealing significant improvements over the non-equivariant Music101 prototype Music101 in both weighted loss and exact accuracy metrics, despite using fewer parameters. This work showcases the adaptability of self-attention mechanisms and layer normalization to the discrete musical domain, addressing challenges in computational music analysis. With its stable and flexible neural framework, Music102 sets the stage for further exploration in equivariant music generation and computational composition tools, bridging mathematical theory with practical music performance.
title Music102: An $D_{12}$-equivariant transformer for chord progression accompaniment
topic Sound
Machine Learning
Multimedia
Audio and Speech Processing
url https://arxiv.org/abs/2410.18151