:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shin, Saebyeol, Wan, Chao, Liu, Zhenzhen, Lovelace, Justin, Lin, Daniel C., Weinberger, Kilian Q., Thickstun, John
Format:	Preprint
Published:	2026
Subjects:	Sound Machine Learning
Online Access:	https://arxiv.org/abs/2605.24193
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sample-Efficient Diffusion for Text-To-Speech Synthesis
by: Lovelace, Justin, et al.
Published: (2024)

Assessing Factual Music Comprehension in Large Audio Language Models
by: Lin, Daniel Chenyu, et al.
Published: (2025)

Anticipatory Music Transformer
by: Thickstun, John, et al.
Published: (2023)

Count The Notes: Histogram-Based Supervision for Automatic Music Transcription
by: Yaffe, Jonathan, et al.
Published: (2025)

IncDSI: Incrementally Updatable Document Retrieval
by: Kishore, Varsha, et al.
Published: (2023)

Diffusion Guided Language Modeling
by: Lovelace, Justin, et al.
Published: (2024)

Sound and Music Biases in Deep Music Transcription Models: A Systematic Analysis
by: Marták, Lukáš Samuel, et al.
Published: (2025)

Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond
by: Wang, Qizhou, et al.
Published: (2025)

Adaptive Moments are Surprisingly Effective for Plug-and-Play Diffusion Sampling
by: Belardi, Christian, et al.
Published: (2026)

Prescriptive Scaling Laws for Data Constrained Training
by: Lovelace, Justin, et al.
Published: (2026)

Hookpad Aria: A Copilot for Songwriters
by: Donahue, Chris, et al.
Published: (2025)

Quantifying the Corpus Bias Problem in Automatic Music Transcription Systems
by: Marták, Lukáš Samuel, et al.
Published: (2024)

Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription
by: Cwitkowitz, Frank, et al.
Published: (2023)

AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model
by: Komiya, Kazuma, et al.
Published: (2024)

Stop-Think-AutoRegress: Language Modeling with Latent Diffusion Planning
by: Lovelace, Justin, et al.
Published: (2026)

SpeechOp: Inference-Time Task Composition for Generative Speech Processing
by: Lovelace, Justin, et al.
Published: (2025)

Machine Learning Techniques in Automatic Music Transcription: A Systematic Survey
by: Jamshidi, Fatemeh, et al.
Published: (2024)

Rethinking Music Captioning with Music Metadata LLMs
by: Bukey, Irmak, et al.
Published: (2026)

Musical Attention Transformer: Music Generation Using a Music-Specific Attention Model
by: Taksuka, Shinnosuke, et al.
Published: (2026)

DiffRoll: Diffusion-based Generative Music Transcription with Unsupervised Pretraining Capability
by: Cheuk, Kin Wai, et al.
Published: (2022)

YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
by: Chang, Sungkyun, et al.
Published: (2024)

Detecting Musical Deepfakes
by: Sunday, Nick
Published: (2025)

Automatic Music Transcription using Convolutional Neural Networks and Constant-Q transform
by: Telila, Yohannis, et al.
Published: (2025)

ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis
by: Ni-Hahn, Stephen, et al.
Published: (2025)

Source Separation for A Cappella Music
by: Lanzendörfer, Luca A., et al.
Published: (2025)

VioPTT: Violin Technique-Aware Transcription from Synthetic Data Augmentation
by: Wang, Ting-Kang, et al.
Published: (2025)

Linear Complexity Self-Supervised Learning for Music Understanding with Random Quantizer
by: Vavaroutsos, Petros, et al.
Published: (2026)

RUMAA: Repeat-Aware Unified Music Audio Analysis for Score-Performance Alignment, Transcription, and Mistake Detection
by: Chang, Sungkyun, et al.
Published: (2025)

Investigating Modality Contribution in Audio LLMs for Music
by: Morais, Giovana, et al.
Published: (2025)

MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage
by: Tan, Hao Hao, et al.
Published: (2024)

Bangla Music Genre Classification Using Bidirectional LSTMS
by: Rahaman, Muntakimur, et al.
Published: (2026)

Music Genre Classification Using Machine Learning Techniques
by: Mishra, Alokit, et al.
Published: (2025)

A Study on the Data Distribution Gap in Music Emotion Recognition
by: Ching, Joann, et al.
Published: (2025)

Bias beyond Borders: Global Inequalities in AI-Generated Music
by: Solak, Ahmet, et al.
Published: (2025)

High-Fidelity Music Vocoder using Neural Audio Codecs
by: Lanzendörfer, Luca A., et al.
Published: (2025)

Benchmarking Music Generation Models and Metrics via Human Preference Studies
by: Grötschla, Florian, et al.
Published: (2025)

Large Language Models' Internal Perception of Symbolic Music
by: Shin, Andrew, et al.
Published: (2025)

Multi-Source Music Generation with Latent Diffusion
by: Xu, Zhongweiyang, et al.
Published: (2024)

SAGE-Music: Low-Latency Symbolic Music Generation via Attribute-Specialized Key-Value Head Sharing
by: Tan, Jiaye, et al.
Published: (2025)

Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)