Saved in:
| Main Authors: | Singh, Shubhr, Bhat, Kiran, Riley, Xavier, Resnick, Benjamin, Thickstun, John, De Brouwer, Walter |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.05399 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025)
by: Singh, Shubhr, et al.
Published: (2025)
Aligning Text-to-Music Evaluation with Human Preferences
by: Huang, Yichen, et al.
Published: (2025)
by: Huang, Yichen, et al.
Published: (2025)
Variable-Length Audio Fingerprinting
by: Chen, Hongjie, et al.
Published: (2026)
by: Chen, Hongjie, et al.
Published: (2026)
Assessing Factual Music Comprehension in Large Audio Language Models
by: Lin, Daniel Chenyu, et al.
Published: (2025)
by: Lin, Daniel Chenyu, et al.
Published: (2025)
Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification
by: Araz, R. Oguz, et al.
Published: (2025)
by: Araz, R. Oguz, et al.
Published: (2025)
The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs
by: Sadok, Samir, et al.
Published: (2026)
by: Sadok, Samir, et al.
Published: (2026)
Latent-Mark: An Audio Watermark Robust to Neural Resynthesis
by: Chen, Yen-Shan, et al.
Published: (2026)
by: Chen, Yen-Shan, et al.
Published: (2026)
Towards Assessing Data Replication in Music Generation with Music Similarity Metrics on Raw Audio
by: Batlle-Roca, Roser, et al.
Published: (2024)
by: Batlle-Roca, Roser, et al.
Published: (2024)
AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech
by: Qiu, Jielin, et al.
Published: (2026)
by: Qiu, Jielin, et al.
Published: (2026)
Exploring How Audio Effects Alter Emotion with Foundation Models
by: Katsis, Stelios, et al.
Published: (2025)
by: Katsis, Stelios, et al.
Published: (2025)
Hookpad Aria: A Copilot for Songwriters
by: Donahue, Chris, et al.
Published: (2025)
by: Donahue, Chris, et al.
Published: (2025)
S-SONDO: Self-Supervised Knowledge Distillation for General Audio Foundation Models
by: Adlouni, Mohammed Ali El, et al.
Published: (2026)
by: Adlouni, Mohammed Ali El, et al.
Published: (2026)
Estimating Musical Surprisal in Audio
by: Bjare, Mathias Rose, et al.
Published: (2025)
by: Bjare, Mathias Rose, et al.
Published: (2025)
Generating Separated Singing Vocals Using a Diffusion Model Conditioned on Music Mixtures
by: Plaja-Roglans, Genís, et al.
Published: (2025)
by: Plaja-Roglans, Genís, et al.
Published: (2025)
Leveraging Pre-Trained Autoencoders for Interpretable Prototype Learning of Music Audio
by: Alonso-Jiménez, Pablo, et al.
Published: (2024)
by: Alonso-Jiménez, Pablo, et al.
Published: (2024)
Rebellion: Noise-Robust Reasoning Training for Audio Reasoning Models
by: Huang, Tiansheng, et al.
Published: (2025)
by: Huang, Tiansheng, et al.
Published: (2025)
Pretrained Conformers for Audio Fingerprinting and Retrieval
by: Altwlkany, Kemal, et al.
Published: (2025)
by: Altwlkany, Kemal, et al.
Published: (2025)
Codec-Robust Attacks on Audio LLMs
by: Roh, Jaechul, et al.
Published: (2026)
by: Roh, Jaechul, et al.
Published: (2026)
Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model
by: Barnett, Julia, et al.
Published: (2024)
by: Barnett, Julia, et al.
Published: (2024)
Audio Explanation Synthesis with Generative Foundation Models
by: Akman, Alican, et al.
Published: (2024)
by: Akman, Alican, et al.
Published: (2024)
Perceptual Musical Features for Interpretable Audio Tagging
by: Lyberatos, Vassilis, et al.
Published: (2023)
by: Lyberatos, Vassilis, et al.
Published: (2023)
ReMi: A Random Recurrent Neural Network Approach to Music Production
by: Chateau-Laurent, Hugo, et al.
Published: (2025)
by: Chateau-Laurent, Hugo, et al.
Published: (2025)
Evaluating Neural Networks Architectures for Spring Reverb Modelling
by: Papaleo, Francesco, et al.
Published: (2024)
by: Papaleo, Francesco, et al.
Published: (2024)
A Survey of Foundation Models for Music Understanding
by: Li, Wenjun, et al.
Published: (2024)
by: Li, Wenjun, et al.
Published: (2024)
Advancing the Foundation Model for Music Understanding
by: Jiang, Yi, et al.
Published: (2025)
by: Jiang, Yi, et al.
Published: (2025)
Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning
by: Tsai, Fang-Duo, et al.
Published: (2024)
by: Tsai, Fang-Duo, et al.
Published: (2024)
UltraEval-Audio: A Unified Framework for Comprehensive Evaluation of Audio Foundation Models
by: Shi, Qundong, et al.
Published: (2026)
by: Shi, Qundong, et al.
Published: (2026)
Are Audio-Language Models Listening? Audio-Specialist Heads for Adaptive Audio Steering
by: Glazer, Neta, et al.
Published: (2026)
by: Glazer, Neta, et al.
Published: (2026)
Estimating Musical Surprisal from Audio in Autoregressive Diffusion Model Noise Spaces
by: Bjare, Mathias Rose, et al.
Published: (2025)
by: Bjare, Mathias Rose, et al.
Published: (2025)
AnalysisGNN: Unified Music Analysis with Graph Neural Networks
by: Karystinaios, Emmanouil, et al.
Published: (2025)
by: Karystinaios, Emmanouil, et al.
Published: (2025)
Comparative Study of State-based Neural Networks for Virtual Analog Audio Effects Modeling
by: Simionato, Riccardo, et al.
Published: (2024)
by: Simionato, Riccardo, et al.
Published: (2024)
Eureka-Audio: Triggering Audio Intelligence in Compact Language Models
by: Zhang, Dan, et al.
Published: (2026)
by: Zhang, Dan, et al.
Published: (2026)
Music Style Transfer With Diffusion Model
by: Huang, Hong, et al.
Published: (2024)
by: Huang, Hong, et al.
Published: (2024)
Self Voice Conversion as an Attack against Neural Audio Watermarking
by: Özer, Yigitcan, et al.
Published: (2026)
by: Özer, Yigitcan, et al.
Published: (2026)
Generative AI for Music and Audio
by: Dong, Hao-Wen
Published: (2024)
by: Dong, Hao-Wen
Published: (2024)
The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization
by: Zhang, Ruixing, et al.
Published: (2026)
by: Zhang, Ruixing, et al.
Published: (2026)
Musical Score Understanding Benchmark: Evaluating Large Language Models' Comprehension of Complete Musical Scores
by: Dai, Congren, et al.
Published: (2025)
by: Dai, Congren, et al.
Published: (2025)
Modeling Music as a Time-Frequency Image: A 2D Tokenizer for Music Generation
by: Cheng, Yuqing, et al.
Published: (2026)
by: Cheng, Yuqing, et al.
Published: (2026)
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music
by: Ghosh, Sreyan, et al.
Published: (2026)
by: Ghosh, Sreyan, et al.
Published: (2026)
GraFPrint: A GNN-Based Approach for Audio Identification
by: Bhattacharjee, Aditya, et al.
Published: (2024)
by: Bhattacharjee, Aditya, et al.
Published: (2024)
Similar Items
-
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025) -
Aligning Text-to-Music Evaluation with Human Preferences
by: Huang, Yichen, et al.
Published: (2025) -
Variable-Length Audio Fingerprinting
by: Chen, Hongjie, et al.
Published: (2026) -
Assessing Factual Music Comprehension in Large Audio Language Models
by: Lin, Daniel Chenyu, et al.
Published: (2025) -
Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification
by: Araz, R. Oguz, et al.
Published: (2025)