Saved in:
Bibliographic Details
Main Authors: Edwards, Drew, Riley, Xavier, Sarmento, Pedro, Dixon, Simon
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.05024
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909282822258688
author Edwards, Drew
Riley, Xavier
Sarmento, Pedro
Dixon, Simon
author_facet Edwards, Drew
Riley, Xavier
Sarmento, Pedro
Dixon, Simon
contents Guitar tablatures enrich the structure of traditional music notation by assigning each note to a string and fret of a guitar in a particular tuning, indicating precisely where to play the note on the instrument. The problem of generating tablature from a symbolic music representation involves inferring this string and fret assignment per note across an entire composition or performance. On the guitar, multiple string-fret assignments are possible for most pitches, which leads to a large combinatorial space that prevents exhaustive search approaches. Most modern methods use constraint-based dynamic programming to minimize some cost function (e.g.\ hand position movement). In this work, we introduce a novel deep learning solution to symbolic guitar tablature estimation. We train an encoder-decoder Transformer model in a masked language modeling paradigm to assign notes to strings. The model is first pre-trained on DadaGP, a dataset of over 25K tablatures, and then fine-tuned on a curated set of professionally transcribed guitar performances. Given the subjective nature of assessing tablature quality, we conduct a user study amongst guitarists, wherein we ask participants to rate the playability of multiple versions of tablature for the same four-bar excerpt. The results indicate our system significantly outperforms competing algorithms.
format Preprint
id arxiv_https___arxiv_org_abs_2408_05024
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
Edwards, Drew
Riley, Xavier
Sarmento, Pedro
Dixon, Simon
Sound
Computation and Language
Information Retrieval
Guitar tablatures enrich the structure of traditional music notation by assigning each note to a string and fret of a guitar in a particular tuning, indicating precisely where to play the note on the instrument. The problem of generating tablature from a symbolic music representation involves inferring this string and fret assignment per note across an entire composition or performance. On the guitar, multiple string-fret assignments are possible for most pitches, which leads to a large combinatorial space that prevents exhaustive search approaches. Most modern methods use constraint-based dynamic programming to minimize some cost function (e.g.\ hand position movement). In this work, we introduce a novel deep learning solution to symbolic guitar tablature estimation. We train an encoder-decoder Transformer model in a masked language modeling paradigm to assign notes to strings. The model is first pre-trained on DadaGP, a dataset of over 25K tablatures, and then fine-tuned on a curated set of professionally transcribed guitar performances. Given the subjective nature of assessing tablature quality, we conduct a user study amongst guitarists, wherein we ask participants to rate the playability of multiple versions of tablature for the same four-bar excerpt. The results indicate our system significantly outperforms competing algorithms.
title MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling
topic Sound
Computation and Language
Information Retrieval
url https://arxiv.org/abs/2408.05024