Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Paroiu, Razvan, Trausan-Matu, Stefan
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2504.02586
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910902714892288
author	Paroiu, Razvan Trausan-Matu, Stefan
author_facet	Paroiu, Razvan Trausan-Matu, Stefan
contents	This paper introduces four different artificial intelligence algorithms for music generation and aims to compare these methods not only based on the aesthetic quality of the generated music but also on their suitability for specific applications. The first set of melodies is produced by a slightly modified visual transformer neural network that is used as a language model. The second set of melodies is generated by combining chat sonification with a classic transformer neural network (the same method of music generation is presented in a previous research), the third set of melodies is generated by combining the Schillinger rhythm theory together with a classic transformer neural network, and the fourth set of melodies is generated using GPT3 transformer provided by OpenAI. A comparative analysis is performed on the melodies generated by these approaches and the results indicate that significant differences can be observed between them and regarding the aesthetic value of them, GPT3 produced the most pleasing melodies, and the newly introduced Schillinger method proved to generate better sounding music than previous sonification methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_02586
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Deep learning for music generation. Four approaches and their comparative evaluation Paroiu, Razvan Trausan-Matu, Stefan Sound Artificial Intelligence Audio and Speech Processing This paper introduces four different artificial intelligence algorithms for music generation and aims to compare these methods not only based on the aesthetic quality of the generated music but also on their suitability for specific applications. The first set of melodies is produced by a slightly modified visual transformer neural network that is used as a language model. The second set of melodies is generated by combining chat sonification with a classic transformer neural network (the same method of music generation is presented in a previous research), the third set of melodies is generated by combining the Schillinger rhythm theory together with a classic transformer neural network, and the fourth set of melodies is generated using GPT3 transformer provided by OpenAI. A comparative analysis is performed on the melodies generated by these approaches and the results indicate that significant differences can be observed between them and regarding the aesthetic value of them, GPT3 produced the most pleasing melodies, and the newly introduced Schillinger method proved to generate better sounding music than previous sonification methods.
title	Deep learning for music generation. Four approaches and their comparative evaluation
topic	Sound Artificial Intelligence Audio and Speech Processing
url	https://arxiv.org/abs/2504.02586

Similar Items