Saved in:
| Main Authors: | Murgul, Sebastian, Schimper, Johannes, Heizmann, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.07973 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription
by: Hamberger, Anna, et al.
Published: (2025)
by: Hamberger, Anna, et al.
Published: (2025)
Beat-Based Rhythm Quantization of MIDI Performances
by: Wachter, Maximilian, et al.
Published: (2025)
by: Wachter, Maximilian, et al.
Published: (2025)
Fine-Tuning MIDI-to-Audio Alignment using a Neural Network on Piano Roll and CQT Representations
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
Transformer-Based Rhythm Quantization of Performance MIDI Using Beat Annotations
by: Wachter, Maximilian, et al.
Published: (2026)
by: Wachter, Maximilian, et al.
Published: (2026)
Guitar-TECHS: An Electric Guitar Dataset Covering Techniques, Musical Excerpts, Chords and Scales Using a Diverse Array of Hardware
by: Pedroza, Hegel, et al.
Published: (2025)
by: Pedroza, Hegel, et al.
Published: (2025)
Leveraging Real Electric Guitar Tones and Effects to Improve Robustness in Guitar Tablature Transcription Modeling
by: Pedroza, Hegel, et al.
Published: (2024)
by: Pedroza, Hegel, et al.
Published: (2024)
Acoustically Precise Hesitation Tagging Is Essential for End-to-End Verbatim Transcription Systems
by: Lin, Jhen-Ke, et al.
Published: (2025)
by: Lin, Jhen-Ke, et al.
Published: (2025)
Production and Manufacturing of 3D Printed Acoustic Guitars
by: Tran, Timothy, et al.
Published: (2025)
by: Tran, Timothy, et al.
Published: (2025)
CALM: Joint Contextual Acoustic-Linguistic Modeling for Personalization of Multi-Speaker ASR
by: Shakeel, Muhammad, et al.
Published: (2026)
by: Shakeel, Muhammad, et al.
Published: (2026)
GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model
by: Riley, Xavier, et al.
Published: (2024)
by: Riley, Xavier, et al.
Published: (2024)
Towards Generalizability to Tone and Content Variations in the Transcription of Amplifier Rendered Electric Guitar Audio
by: Chen, Yu-Hua, et al.
Published: (2025)
by: Chen, Yu-Hua, et al.
Published: (2025)
audio2chart: End to End Audio Transcription into playable Guitar Hero charts
by: Tripodi, Riccardo
Published: (2025)
by: Tripodi, Riccardo
Published: (2025)
DDSP Guitar Amp: Interpretable Guitar Amplifier Modeling
by: Yeh, Yen-Tung, et al.
Published: (2024)
by: Yeh, Yen-Tung, et al.
Published: (2024)
Infusing Acoustic Pause Context into Text-Based Dementia Assessment
by: Braun, Franziska, et al.
Published: (2024)
by: Braun, Franziska, et al.
Published: (2024)
High Resolution Guitar Transcription via Domain Adaptation
by: Riley, Xavier, et al.
Published: (2024)
by: Riley, Xavier, et al.
Published: (2024)
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection
by: Yosha, Iddo, et al.
Published: (2025)
by: Yosha, Iddo, et al.
Published: (2025)
Fotheidil: an Automatic Transcription System for the Irish Language
by: Lonergan, Liam, et al.
Published: (2024)
by: Lonergan, Liam, et al.
Published: (2024)
K-Function: Joint Pronunciation Transcription and Feedback for Evaluating Kids Language Function
by: Li, Shuhe, et al.
Published: (2025)
by: Li, Shuhe, et al.
Published: (2025)
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation
by: Hu, Rui, et al.
Published: (2025)
by: Hu, Rui, et al.
Published: (2025)
Mind the Gap: Entity-Preserved Context-Aware ASR Structured Transcriptions
by: Altinok, Duygu
Published: (2025)
by: Altinok, Duygu
Published: (2025)
Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model
by: Huang, Jiawen, et al.
Published: (2024)
by: Huang, Jiawen, et al.
Published: (2024)
Revisiting Acoustic Features for Robust ASR
by: Shah, Muhammad A., et al.
Published: (2024)
by: Shah, Muhammad A., et al.
Published: (2024)
The Sound of Healthcare: Improving Medical Transcription ASR Accuracy with Large Language Models
by: Adedeji, Ayo, et al.
Published: (2024)
by: Adedeji, Ayo, et al.
Published: (2024)
A Theoretical Framework for Acoustic Neighbor Embeddings
by: Jeon, Woojay
Published: (2024)
by: Jeon, Woojay
Published: (2024)
Exploring the Benefits of Tokenization of Discrete Acoustic Units
by: Dekel, Avihu, et al.
Published: (2024)
by: Dekel, Avihu, et al.
Published: (2024)
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
by: Zhuo, Le, et al.
Published: (2023)
by: Zhuo, Le, et al.
Published: (2023)
Exploring Spoken Language Identification Strategies for Automatic Transcription of Multilingual Broadcast and Institutional Speech
by: Valente, Martina, et al.
Published: (2024)
by: Valente, Martina, et al.
Published: (2024)
Salmon: A Suite for Acoustic Language Model Evaluation
by: Maimon, Gallil, et al.
Published: (2024)
by: Maimon, Gallil, et al.
Published: (2024)
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
by: Liang, Siyu, et al.
Published: (2025)
by: Liang, Siyu, et al.
Published: (2025)
Universal Acoustic Adversarial Attacks for Flexible Control of Speech-LLMs
by: Ma, Rao, et al.
Published: (2025)
by: Ma, Rao, et al.
Published: (2025)
A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework
by: Nan, Zheng, et al.
Published: (2024)
by: Nan, Zheng, et al.
Published: (2024)
Guitar Pickups I: Analysis of the Effect of Winding and Wire Gauge on Single Coil Electric Guitar Pickups
by: Batchelor, Charles, et al.
Published: (2024)
by: Batchelor, Charles, et al.
Published: (2024)
Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context
by: Ao, Junyi, et al.
Published: (2025)
by: Ao, Junyi, et al.
Published: (2025)
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation
by: Wei, Victor Junqiu, et al.
Published: (2024)
by: Wei, Victor Junqiu, et al.
Published: (2024)
From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data
by: Attia, Ahmed Adel, et al.
Published: (2025)
by: Attia, Ahmed Adel, et al.
Published: (2025)
Voice Conversion for Lombard Speaking Style with Implicit and Explicit Acoustic Feature Conditioning
by: Woszczyk, Dominika, et al.
Published: (2025)
by: Woszczyk, Dominika, et al.
Published: (2025)
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces
by: Chang, Heng-Jui, et al.
Published: (2023)
by: Chang, Heng-Jui, et al.
Published: (2023)
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models
by: Raina, Vyas, et al.
Published: (2024)
by: Raina, Vyas, et al.
Published: (2024)
Similar Items
-
Exploring Procedural Data Generation for Automatic Acoustic Guitar Fingerpicking Transcription
by: Murgul, Sebastian, et al.
Published: (2025) -
Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
by: Murgul, Sebastian, et al.
Published: (2025) -
Fretting-Transformer: Encoder-Decoder Model for MIDI to Tablature Transcription
by: Hamberger, Anna, et al.
Published: (2025) -
Beat-Based Rhythm Quantization of MIDI Performances
by: Wachter, Maximilian, et al.
Published: (2025) -
Fine-Tuning MIDI-to-Audio Alignment using a Neural Network on Piano Roll and CQT Representations
by: Murgul, Sebastian, et al.
Published: (2025)