Saved in:
Bibliographic Details
Main Authors: Lim, Wen Qing, Liang, Jinhua, Zhang, Huan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.08155
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912024983764992
author Lim, Wen Qing
Liang, Jinhua
Zhang, Huan
author_facet Lim, Wen Qing
Liang, Jinhua
Zhang, Huan
contents Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music generation is sparse. Earlier graph-based music generation worked only on generating melodies, and recent works to generate polyphonic music do not account for longer-term structure. In this paper, we explore a multi-graph approach to represent both the rhythmic patterns and phrase structure of Chinese pop music. Consequently, we propose a two-step approach that aims to generate polyphonic music with coherent rhythm and long-term structure. We train two Variational Auto-Encoder networks - one on a MIDI dataset to generate 4-bar phrases, and another on song structure labels to generate full song structure. Our work shows that the models are able to learn most of the structural nuances in the training dataset, including chord and pitch frequency distributions, and phrase attributes.
format Preprint
id arxiv_https___arxiv_org_abs_2409_08155
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Hierarchical Symbolic Pop Music Generation with Graph Neural Networks
Lim, Wen Qing
Liang, Jinhua
Zhang, Huan
Audio and Speech Processing
Music is inherently made up of complex structures, and representing them as graphs helps to capture multiple levels of relationships. While music generation has been explored using various deep generation techniques, research on graph-related music generation is sparse. Earlier graph-based music generation worked only on generating melodies, and recent works to generate polyphonic music do not account for longer-term structure. In this paper, we explore a multi-graph approach to represent both the rhythmic patterns and phrase structure of Chinese pop music. Consequently, we propose a two-step approach that aims to generate polyphonic music with coherent rhythm and long-term structure. We train two Variational Auto-Encoder networks - one on a MIDI dataset to generate 4-bar phrases, and another on song structure labels to generate full song structure. Our work shows that the models are able to learn most of the structural nuances in the training dataset, including chord and pitch frequency distributions, and phrase attributes.
title Hierarchical Symbolic Pop Music Generation with Graph Neural Networks
topic Audio and Speech Processing
url https://arxiv.org/abs/2409.08155