Saved in:
| Main Authors: | Bukey, Irmak, Wang, Zhepei, Donahue, Chris, Bryan, Nicholas J. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.03023 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Just Label the Repeats for In-The-Wild Audio-to-Score Alignment
by: Bukey, Irmak, et al.
Published: (2024)
by: Bukey, Irmak, et al.
Published: (2024)
Anticipatory Music Transformer
by: Thickstun, John, et al.
Published: (2023)
by: Thickstun, John, et al.
Published: (2023)
Do Music Generation Models Encode Music Theory?
by: Wei, Megan, et al.
Published: (2024)
by: Wei, Megan, et al.
Published: (2024)
Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio
by: Jung, Jongmin, et al.
Published: (2025)
by: Jung, Jongmin, et al.
Published: (2025)
Stemphonic: All-at-once Flexible Multi-stem Music Generation
by: Wu, Shih-Lun, et al.
Published: (2026)
by: Wu, Shih-Lun, et al.
Published: (2026)
Investigating Modality Contribution in Audio LLMs for Music
by: Morais, Giovana, et al.
Published: (2025)
by: Morais, Giovana, et al.
Published: (2025)
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation
by: Novack, Zachary, et al.
Published: (2024)
by: Novack, Zachary, et al.
Published: (2024)
Musical Attention Transformer: Music Generation Using a Music-Specific Attention Model
by: Taksuka, Shinnosuke, et al.
Published: (2026)
by: Taksuka, Shinnosuke, et al.
Published: (2026)
DITTO: Diffusion Inference-Time T-Optimization for Music Generation
by: Novack, Zachary, et al.
Published: (2024)
by: Novack, Zachary, et al.
Published: (2024)
Survey on the Evaluation of Generative Models in Music
by: Lerch, Alexander, et al.
Published: (2025)
by: Lerch, Alexander, et al.
Published: (2025)
Live Music Models
by: Lyria Team, et al.
Published: (2025)
by: Lyria Team, et al.
Published: (2025)
Detecting Musical Deepfakes
by: Sunday, Nick
Published: (2025)
by: Sunday, Nick
Published: (2025)
Presto! Distilling Steps and Layers for Accelerating Music Generation
by: Novack, Zachary, et al.
Published: (2024)
by: Novack, Zachary, et al.
Published: (2024)
Sound and Music Biases in Deep Music Transcription Models: A Systematic Analysis
by: Marták, Lukáš Samuel, et al.
Published: (2025)
by: Marták, Lukáš Samuel, et al.
Published: (2025)
Music Transcription with (Almost) No Supervision
by: Shin, Saebyeol, et al.
Published: (2026)
by: Shin, Saebyeol, et al.
Published: (2026)
Source Separation for A Cappella Music
by: Lanzendörfer, Luca A., et al.
Published: (2025)
by: Lanzendörfer, Luca A., et al.
Published: (2025)
MusicRL: Aligning Music Generation to Human Preferences
by: Cideron, Geoffrey, et al.
Published: (2024)
by: Cideron, Geoffrey, et al.
Published: (2024)
V2Meow: Meowing to the Visual Beat via Video-to-Music Generation
by: Su, Kun, et al.
Published: (2023)
by: Su, Kun, et al.
Published: (2023)
Bangla Music Genre Classification Using Bidirectional LSTMS
by: Rahaman, Muntakimur, et al.
Published: (2026)
by: Rahaman, Muntakimur, et al.
Published: (2026)
Music Genre Classification Using Machine Learning Techniques
by: Mishra, Alokit, et al.
Published: (2025)
by: Mishra, Alokit, et al.
Published: (2025)
MusicLIME: Explainable Multimodal Music Understanding
by: Sotirou, Theodoros, et al.
Published: (2024)
by: Sotirou, Theodoros, et al.
Published: (2024)
Depth-Structured Music Recurrence: Budgeted Recurrent Attention for Full-Piece Symbolic Music Modeling
by: Yi, Yungang, et al.
Published: (2026)
by: Yi, Yungang, et al.
Published: (2026)
Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)
by: Xue, Chenhao, et al.
Published: (2026)
V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation
by: Lin, Yan-Bo, et al.
Published: (2026)
by: Lin, Yan-Bo, et al.
Published: (2026)
A Study on the Data Distribution Gap in Music Emotion Recognition
by: Ching, Joann, et al.
Published: (2025)
by: Ching, Joann, et al.
Published: (2025)
Bias beyond Borders: Global Inequalities in AI-Generated Music
by: Solak, Ahmet, et al.
Published: (2025)
by: Solak, Ahmet, et al.
Published: (2025)
Count The Notes: Histogram-Based Supervision for Automatic Music Transcription
by: Yaffe, Jonathan, et al.
Published: (2025)
by: Yaffe, Jonathan, et al.
Published: (2025)
High-Fidelity Music Vocoder using Neural Audio Codecs
by: Lanzendörfer, Luca A., et al.
Published: (2025)
by: Lanzendörfer, Luca A., et al.
Published: (2025)
ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis
by: Ni-Hahn, Stephen, et al.
Published: (2025)
by: Ni-Hahn, Stephen, et al.
Published: (2025)
Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music
by: Tunturi, Eetu, et al.
Published: (2025)
by: Tunturi, Eetu, et al.
Published: (2025)
Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces
by: Atassi, Lilac
Published: (2024)
by: Atassi, Lilac
Published: (2024)
Music Arena: Live Evaluation for Text-to-Music
by: Kim, Yonghyun, et al.
Published: (2025)
by: Kim, Yonghyun, et al.
Published: (2025)
On Class Separability Pitfalls In Audio-Text Contrastive Zero-Shot Learning
by: Tavares, Tiago, et al.
Published: (2024)
by: Tavares, Tiago, et al.
Published: (2024)
Music Foundation Model as Generic Booster for Music Downstream Tasks
by: Liao, WeiHsiang, et al.
Published: (2024)
by: Liao, WeiHsiang, et al.
Published: (2024)
Benchmarking Music Generation Models and Metrics via Human Preference Studies
by: Grötschla, Florian, et al.
Published: (2025)
by: Grötschla, Florian, et al.
Published: (2025)
Live Music Diffusion Models: Efficient Fine-Tuning and Post-Training of Interactive Diffusion Music Generators
by: Novack, Zachary, et al.
Published: (2026)
by: Novack, Zachary, et al.
Published: (2026)
Brain2Music: Reconstructing Music from Human Brain Activity
by: Denk, Timo I., et al.
Published: (2023)
by: Denk, Timo I., et al.
Published: (2023)
Universal Music Representations? Evaluating Foundation Models on World Music Corpora
by: Papaioannou, Charilaos, et al.
Published: (2025)
by: Papaioannou, Charilaos, et al.
Published: (2025)
Predicting User Intents and Musical Attributes from Music Discovery Conversations
by: Kwon, Daeyong, et al.
Published: (2024)
by: Kwon, Daeyong, et al.
Published: (2024)
Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio
by: Long, Phillip, et al.
Published: (2026)
by: Long, Phillip, et al.
Published: (2026)
Similar Items
-
Just Label the Repeats for In-The-Wild Audio-to-Score Alignment
by: Bukey, Irmak, et al.
Published: (2024) -
Anticipatory Music Transformer
by: Thickstun, John, et al.
Published: (2023) -
Do Music Generation Models Encode Music Theory?
by: Wei, Megan, et al.
Published: (2024) -
Unified Cross-modal Translation of Score Images, Symbolic Music, and Performance Audio
by: Jung, Jongmin, et al.
Published: (2025) -
Stemphonic: All-at-once Flexible Multi-stem Music Generation
by: Wu, Shih-Lun, et al.
Published: (2026)