Saved in:
| Main Authors: | Ding, Yiwei, Lerch, Alexander |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.19371 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncertainty Estimation in the Real World: A Study on Music Emotion Recognition
by: Watcharasupat, Karn N., et al.
Published: (2025)
by: Watcharasupat, Karn N., et al.
Published: (2025)
Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries
by: Watcharasupat, Karn N., et al.
Published: (2025)
by: Watcharasupat, Karn N., et al.
Published: (2025)
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
by: Watcharasupat, Karn N., et al.
Published: (2024)
by: Watcharasupat, Karn N., et al.
Published: (2024)
Music auto-tagging in the long tail: A few-shot approach
by: Ma, T. Aleksandra, et al.
Published: (2024)
by: Ma, T. Aleksandra, et al.
Published: (2024)
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
by: Watcharasupat, Karn N., et al.
Published: (2023)
by: Watcharasupat, Karn N., et al.
Published: (2023)
Tune It Up: Music Genre Transfer and Prediction
by: Samet, Fidan, et al.
Published: (2025)
by: Samet, Fidan, et al.
Published: (2025)
Do Foundational Audio Encoders Understand Music Structure?
by: Toyama, Keisuke, et al.
Published: (2025)
by: Toyama, Keisuke, et al.
Published: (2025)
Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music
by: Kim, Alexander, et al.
Published: (2025)
by: Kim, Alexander, et al.
Published: (2025)
Mathematical Foundations of Polyphonic Music Generation via Structural Inductive Bias
by: Seo, Joonwon
Published: (2026)
by: Seo, Joonwon
Published: (2026)
Music Boomerang: Reusing Diffusion Models for Data Augmentation and Audio Manipulation
by: Fichtinger, Alexander, et al.
Published: (2025)
by: Fichtinger, Alexander, et al.
Published: (2025)
Music Foundation Model as Generic Booster for Music Downstream Tasks
by: Liao, WeiHsiang, et al.
Published: (2024)
by: Liao, WeiHsiang, et al.
Published: (2024)
Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors
by: Han, Chaeyeon, et al.
Published: (2024)
by: Han, Chaeyeon, et al.
Published: (2024)
Universal Music Representations? Evaluating Foundation Models on World Music Corpora
by: Papaioannou, Charilaos, et al.
Published: (2025)
by: Papaioannou, Charilaos, et al.
Published: (2025)
Integrating Text-to-Music Models with Language Models: Composing Long Structured Music Pieces
by: Atassi, Lilac
Published: (2024)
by: Atassi, Lilac
Published: (2024)
Learning Music Audio Representations With Limited Data
by: Plachouras, Christos, et al.
Published: (2025)
by: Plachouras, Christos, et al.
Published: (2025)
Towards Robust Transcription: Exploring Noise Injection Strategies for Training Data Augmentation
by: Kim, Yonghyun, et al.
Published: (2024)
by: Kim, Yonghyun, et al.
Published: (2024)
Watermarking Training Data of Music Generation Models
by: Epple, Pascal, et al.
Published: (2024)
by: Epple, Pascal, et al.
Published: (2024)
Online Symbolic Music Alignment with Offline Reinforcement Learning
by: Peter, Silvan David
Published: (2023)
by: Peter, Silvan David
Published: (2023)
MusicRL: Aligning Music Generation to Human Preferences
by: Cideron, Geoffrey, et al.
Published: (2024)
by: Cideron, Geoffrey, et al.
Published: (2024)
E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models
by: Dong, Jiaheng, et al.
Published: (2025)
by: Dong, Jiaheng, et al.
Published: (2025)
COCOLA: Coherence-Oriented Contrastive Learning of Musical Audio Representations
by: Ciranni, Ruben, et al.
Published: (2024)
by: Ciranni, Ruben, et al.
Published: (2024)
Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds
by: Chang, Andrew, et al.
Published: (2025)
by: Chang, Andrew, et al.
Published: (2025)
Anticipatory Music Transformer
by: Thickstun, John, et al.
Published: (2023)
by: Thickstun, John, et al.
Published: (2023)
Discovering and Steering Interpretable Concepts in Large Generative Music Models
by: Singh, Nikhil, et al.
Published: (2025)
by: Singh, Nikhil, et al.
Published: (2025)
Multi-Source Diffusion Models for Simultaneous Music Generation and Separation
by: Mariani, Giorgio, et al.
Published: (2023)
by: Mariani, Giorgio, et al.
Published: (2023)
LLark: A Multimodal Instruction-Following Language Model for Music
by: Gardner, Josh, et al.
Published: (2023)
by: Gardner, Josh, et al.
Published: (2023)
Revisiting Meter Tracking in Carnatic Music using Deep Learning Approaches
by: Prabhu, Satyajeet
Published: (2025)
by: Prabhu, Satyajeet
Published: (2025)
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation
by: Lashkarashvili, Nineli, et al.
Published: (2024)
by: Lashkarashvili, Nineli, et al.
Published: (2024)
ProGress: Structured Music Generation via Graph Diffusion and Hierarchical Music Analysis
by: Ni-Hahn, Stephen, et al.
Published: (2025)
by: Ni-Hahn, Stephen, et al.
Published: (2025)
Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music
by: Tunturi, Eetu, et al.
Published: (2025)
by: Tunturi, Eetu, et al.
Published: (2025)
Generalized Multi-Source Inference for Text Conditioned Music Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)
by: Postolache, Emilian, et al.
Published: (2024)
PerTok: Expressive Encoding and Modeling of Symbolic Musical Ideas and Variations
by: Lenz, Julian, et al.
Published: (2024)
by: Lenz, Julian, et al.
Published: (2024)
Subtractive Training for Music Stem Insertion using Latent Diffusion Models
by: Villa-Renteria, Ivan, et al.
Published: (2024)
by: Villa-Renteria, Ivan, et al.
Published: (2024)
AI-Assisted Music Production: A User Study on Text-to-Music Models
by: Ronchini, Francesca, et al.
Published: (2025)
by: Ronchini, Francesca, et al.
Published: (2025)
LC-Protonets: Multi-Label Few-Shot Learning for World Music Audio Tagging
by: Papaioannou, Charilaos, et al.
Published: (2024)
by: Papaioannou, Charilaos, et al.
Published: (2024)
Naturalistic Music Decoding from EEG Data via Latent Diffusion Models
by: Postolache, Emilian, et al.
Published: (2024)
by: Postolache, Emilian, et al.
Published: (2024)
UniPET-SPK: A Unified Framework for Parameter-Efficient Tuning of Pre-trained Speech Models for Robust Speaker Verification
by: Sang, Mufan, et al.
Published: (2025)
by: Sang, Mufan, et al.
Published: (2025)
Audio-Based Pedestrian Detection in the Presence of Vehicular Noise
by: Kim, Yonghyun, et al.
Published: (2025)
by: Kim, Yonghyun, et al.
Published: (2025)
SCORE-SET: A dataset of GuitarPro files for Music Phrase Generation and Sequence Learning
by: Begari, Vishakh
Published: (2025)
by: Begari, Vishakh
Published: (2025)
AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model
by: Komiya, Kazuma, et al.
Published: (2024)
by: Komiya, Kazuma, et al.
Published: (2024)
Similar Items
-
Uncertainty Estimation in the Real World: A Study on Music Emotion Recognition
by: Watcharasupat, Karn N., et al.
Published: (2025) -
Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries
by: Watcharasupat, Karn N., et al.
Published: (2025) -
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems
by: Watcharasupat, Karn N., et al.
Published: (2024) -
Music auto-tagging in the long tail: A few-shot approach
by: Ma, T. Aleksandra, et al.
Published: (2024) -
A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation
by: Watcharasupat, Karn N., et al.
Published: (2023)