Saved in:
| Main Authors: | Nestor, Bret, Yao, Bohan, Moore, Jasmine, Kanes, Jasper |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.09295 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ConceptCaps: a Distilled Concept Dataset for Interpretability in Music Models
by: Sienkiewicz, Bruno, et al.
Published: (2026)
by: Sienkiewicz, Bruno, et al.
Published: (2026)
Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025)
by: Paek, Nathan, et al.
Published: (2025)
LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection
by: Stylianou, Ioannis, et al.
Published: (2025)
by: Stylianou, Ioannis, et al.
Published: (2025)
A Dataset for Automatic Vocal Mode Classification
by: Hinrichs, Reemt, et al.
Published: (2026)
by: Hinrichs, Reemt, et al.
Published: (2026)
Histogram-based Parameter-efficient Tuning for Passive and Active Sonar Classification
by: Mohammadi, Amirmohammad, et al.
Published: (2025)
by: Mohammadi, Amirmohammad, et al.
Published: (2025)
Adaptive Discovery of Interpretable Audio Attributes with Multimodal LLMs for Low-Resource Classification
by: Yoshimura, Kosuke, et al.
Published: (2026)
by: Yoshimura, Kosuke, et al.
Published: (2026)
PosCUDA: Position based Convolution for Unlearnable Audio Datasets
by: Gokul, Vignesh, et al.
Published: (2024)
by: Gokul, Vignesh, et al.
Published: (2024)
Parametric Neural Amp Modeling with Active Learning
by: Grötschla, Florian, et al.
Published: (2025)
by: Grötschla, Florian, et al.
Published: (2025)
Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)
by: Xue, Chenhao, et al.
Published: (2026)
PianoCoRe: Combined and Refined Piano MIDI Dataset
by: Borovik, Ilya
Published: (2026)
by: Borovik, Ilya
Published: (2026)
Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training
by: Wu, Yanru, et al.
Published: (2026)
by: Wu, Yanru, et al.
Published: (2026)
Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish
by: Junczyk, Michał
Published: (2024)
by: Junczyk, Michał
Published: (2024)
Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
by: Cheddad, Zohra Adila, et al.
Published: (2021)
by: Cheddad, Zohra Adila, et al.
Published: (2021)
Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)
by: Lindholm, Richard, et al.
Published: (2025)
Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
by: Feng, Pengchao, et al.
Published: (2025)
by: Feng, Pengchao, et al.
Published: (2025)
SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech
by: Sabra, Adam, et al.
Published: (2024)
by: Sabra, Adam, et al.
Published: (2024)
Regularized Schrödinger Bridge: Alleviating Distortion and Exposure Bias in Solving Inverse Problems
by: Yao, Qing, et al.
Published: (2025)
by: Yao, Qing, et al.
Published: (2025)
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning
by: Martinsson, John, et al.
Published: (2024)
by: Martinsson, John, et al.
Published: (2024)
Interpretable SHAP-bounded Bayesian Optimization for Underwater Acoustic Metamaterial Coating Design
by: Weeratunge, Hansani, et al.
Published: (2025)
by: Weeratunge, Hansani, et al.
Published: (2025)
Focal Modulation Networks for Interpretable Sound Classification
by: Della Libera, Luca, et al.
Published: (2024)
by: Della Libera, Luca, et al.
Published: (2024)
Meta-Learning-Based Delayless Subband Adaptive Filter using Complex Self-Attention for Active Noise Control
by: Feng, Pengxing, et al.
Published: (2024)
by: Feng, Pengxing, et al.
Published: (2024)
Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation
by: Agarwal, Manvi, et al.
Published: (2025)
by: Agarwal, Manvi, et al.
Published: (2025)
Semantic-Aware Interpretable Multimodal Music Auto-Tagging
by: Patakis, Andreas, et al.
Published: (2025)
by: Patakis, Andreas, et al.
Published: (2025)
PACE: Pretrained Audio Continual Learning
by: Li, Chang, et al.
Published: (2026)
by: Li, Chang, et al.
Published: (2026)
The iNaturalist Sounds Dataset
by: Chasmai, Mustafa, et al.
Published: (2025)
by: Chasmai, Mustafa, et al.
Published: (2025)
A Data-Centric Framework for Machine Listening Projects: Addressing Large-Scale Data Acquisition and Labeling through Active Learning
by: Naranjo-Alcazar, Javier, et al.
Published: (2024)
by: Naranjo-Alcazar, Javier, et al.
Published: (2024)
Music Genre Classification Using Machine Learning Techniques
by: Mishra, Alokit, et al.
Published: (2025)
by: Mishra, Alokit, et al.
Published: (2025)
A Machine Learning Approach for Denoising and Upsampling HRTFs
by: Hu, Xuyi, et al.
Published: (2025)
by: Hu, Xuyi, et al.
Published: (2025)
Discovering and Steering Interpretable Concepts in Large Generative Music Models
by: Singh, Nikhil, et al.
Published: (2025)
by: Singh, Nikhil, et al.
Published: (2025)
Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
by: Padovese, Bruno, et al.
Published: (2025)
by: Padovese, Bruno, et al.
Published: (2025)
Prototypical Contrastive Learning For Improved Few-Shot Audio Classification
by: Sgouropoulos, Christos, et al.
Published: (2025)
by: Sgouropoulos, Christos, et al.
Published: (2025)
Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling
by: Bradshaw, Louis, et al.
Published: (2025)
by: Bradshaw, Louis, et al.
Published: (2025)
AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds
by: Wang, Qizhou, et al.
Published: (2025)
by: Wang, Qizhou, et al.
Published: (2025)
Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning
by: Wu, Daiqing, et al.
Published: (2026)
by: Wu, Daiqing, et al.
Published: (2026)
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks
by: Vu, Quynh Nguyen-Phuong, et al.
Published: (2025)
by: Vu, Quynh Nguyen-Phuong, et al.
Published: (2025)
Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
by: Zhang, Shiqi, et al.
Published: (2025)
by: Zhang, Shiqi, et al.
Published: (2025)
CyIN: Cyclic Informative Latent Space for Bridging Complete and Incomplete Multimodal Learning
by: Lin, Ronghao, et al.
Published: (2026)
by: Lin, Ronghao, et al.
Published: (2026)
A Multimodal Framework for Dementia Detection via Linguistic and Acoustic Representation Learning
by: Ilias, Loukas, et al.
Published: (2026)
by: Ilias, Loukas, et al.
Published: (2026)
Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
by: Gundluru, Ramesh, et al.
Published: (2025)
by: Gundluru, Ramesh, et al.
Published: (2025)
Clustering of Indonesian and Western Gamelan Orchestras through Machine Learning of Performance Parameters
by: Linke, Simon, et al.
Published: (2024)
by: Linke, Simon, et al.
Published: (2024)
Similar Items
-
ConceptCaps: a Distilled Concept Dataset for Interpretability in Music Models
by: Sienkiewicz, Bruno, et al.
Published: (2026) -
Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025) -
LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection
by: Stylianou, Ioannis, et al.
Published: (2025) -
A Dataset for Automatic Vocal Mode Classification
by: Hinrichs, Reemt, et al.
Published: (2026) -
Histogram-based Parameter-efficient Tuning for Passive and Active Sonar Classification
by: Mohammadi, Amirmohammad, et al.
Published: (2025)