Saved in:
Bibliographic Details
Main Authors: Paroiu, Razvan, Trausan-Matu, Stefan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.02586
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910902714892288
author Paroiu, Razvan
Trausan-Matu, Stefan
author_facet Paroiu, Razvan
Trausan-Matu, Stefan
contents This paper introduces four different artificial intelligence algorithms for music generation and aims to compare these methods not only based on the aesthetic quality of the generated music but also on their suitability for specific applications. The first set of melodies is produced by a slightly modified visual transformer neural network that is used as a language model. The second set of melodies is generated by combining chat sonification with a classic transformer neural network (the same method of music generation is presented in a previous research), the third set of melodies is generated by combining the Schillinger rhythm theory together with a classic transformer neural network, and the fourth set of melodies is generated using GPT3 transformer provided by OpenAI. A comparative analysis is performed on the melodies generated by these approaches and the results indicate that significant differences can be observed between them and regarding the aesthetic value of them, GPT3 produced the most pleasing melodies, and the newly introduced Schillinger method proved to generate better sounding music than previous sonification methods.
format Preprint
id arxiv_https___arxiv_org_abs_2504_02586
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Deep learning for music generation. Four approaches and their comparative evaluation
Paroiu, Razvan
Trausan-Matu, Stefan
Sound
Artificial Intelligence
Audio and Speech Processing
This paper introduces four different artificial intelligence algorithms for music generation and aims to compare these methods not only based on the aesthetic quality of the generated music but also on their suitability for specific applications. The first set of melodies is produced by a slightly modified visual transformer neural network that is used as a language model. The second set of melodies is generated by combining chat sonification with a classic transformer neural network (the same method of music generation is presented in a previous research), the third set of melodies is generated by combining the Schillinger rhythm theory together with a classic transformer neural network, and the fourth set of melodies is generated using GPT3 transformer provided by OpenAI. A comparative analysis is performed on the melodies generated by these approaches and the results indicate that significant differences can be observed between them and regarding the aesthetic value of them, GPT3 produced the most pleasing melodies, and the newly introduced Schillinger method proved to generate better sounding music than previous sonification methods.
title Deep learning for music generation. Four approaches and their comparative evaluation
topic Sound
Artificial Intelligence
Audio and Speech Processing
url https://arxiv.org/abs/2504.02586