Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lim, Wen Qing, Liang, Jinhua, Zhang, Huan
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2409.08155
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912024983764992
author	Lim, Wen Qing Liang, Jinhua Zhang, Huan
author_facet	Lim, Wen Qing Liang, Jinhua Zhang, Huan
contents	Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music generation is sparse. Earlier graph-based music generation worked only on generating melodies, and recent works to generate polyphonic music do not account for longer-term structure. In this paper, we explore a multi-graph approach to represent both the rhythmic patterns and phrase structure of Chinese pop music. Consequently, we propose a two-step approach that aims to generate polyphonic music with coherent rhythm and long-term structure. We train two Variational Auto-Encoder networks - one on a MIDI dataset to generate 4-bar phrases, and another on song structure labels to generate full song structure. Our work shows that the models are able to learn most of the structural nuances in the training dataset, including chord and pitch frequency distributions, and phrase attributes.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_08155
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Hierarchical Symbolic Pop Music Generation with Graph Neural Networks Lim, Wen Qing Liang, Jinhua Zhang, Huan Audio and Speech Processing Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music generation is sparse. Earlier graph-based music generation worked only on generating melodies, and recent works to generate polyphonic music do not account for longer-term structure. In this paper, we explore a multi-graph approach to represent both the rhythmic patterns and phrase structure of Chinese pop music. Consequently, we propose a two-step approach that aims to generate polyphonic music with coherent rhythm and long-term structure. We train two Variational Auto-Encoder networks - one on a MIDI dataset to generate 4-bar phrases, and another on song structure labels to generate full song structure. Our work shows that the models are able to learn most of the structural nuances in the training dataset, including chord and pitch frequency distributions, and phrase attributes.
title	Hierarchical Symbolic Pop Music Generation with Graph Neural Networks
topic	Audio and Speech Processing
url	https://arxiv.org/abs/2409.08155

Similar Items