Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Luo, Weiliang
Format:	Preprint
Published:	2024
Subjects:	Sound Machine Learning Multimedia Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2410.18151
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918515852705792
author	Luo, Weiliang
author_facet	Luo, Weiliang
contents	We present Music102, an advanced model aimed at enhancing chord progression accompaniment through a $D_{12}$-equivariant transformer. Inspired by group theory and symbolic music structures, Music102 leverages musical symmetry--such as transposition and reflection operations--integrating these properties into the transformer architecture. By encoding prior music knowledge, the model maintains equivariance across both melody and chord sequences. The POP909 dataset was employed to train and evaluate Music102, revealing significant improvements over the non-equivariant Music101 prototype Music101 in both weighted loss and exact accuracy metrics, despite using fewer parameters. This work showcases the adaptability of self-attention mechanisms and layer normalization to the discrete musical domain, addressing challenges in computational music analysis. With its stable and flexible neural framework, Music102 sets the stage for further exploration in equivariant music generation and computational composition tools, bridging mathematical theory with practical music performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_18151
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Music102: An $D_{12}$-equivariant transformer for chord progression accompaniment Luo, Weiliang Sound Machine Learning Multimedia Audio and Speech Processing We present Music102, an advanced model aimed at enhancing chord progression accompaniment through a $D_{12}$-equivariant transformer. Inspired by group theory and symbolic music structures, Music102 leverages musical symmetry--such as transposition and reflection operations--integrating these properties into the transformer architecture. By encoding prior music knowledge, the model maintains equivariance across both melody and chord sequences. The POP909 dataset was employed to train and evaluate Music102, revealing significant improvements over the non-equivariant Music101 prototype Music101 in both weighted loss and exact accuracy metrics, despite using fewer parameters. This work showcases the adaptability of self-attention mechanisms and layer normalization to the discrete musical domain, addressing challenges in computational music analysis. With its stable and flexible neural framework, Music102 sets the stage for further exploration in equivariant music generation and computational composition tools, bridging mathematical theory with practical music performance.
title	Music102: An $D_{12}$-equivariant transformer for chord progression accompaniment
topic	Sound Machine Learning Multimedia Audio and Speech Processing
url	https://arxiv.org/abs/2410.18151

Similar Items