:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Nestor, Bret, Yao, Bohan, Moore, Jasmine, Kanes, Jasper
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Sound
Online Access:	https://arxiv.org/abs/2602.09295
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ConceptCaps: a Distilled Concept Dataset for Interpretability in Music Models
by: Sienkiewicz, Bruno, et al.
Published: (2026)

Learning Interpretable Features in Audio Latent Spaces via Sparse Autoencoders
by: Paek, Nathan, et al.
Published: (2025)

LibriVAD: A Scalable Open Dataset with Deep Learning Benchmarks for Voice Activity Detection
by: Stylianou, Ioannis, et al.
Published: (2025)

A Dataset for Automatic Vocal Mode Classification
by: Hinrichs, Reemt, et al.
Published: (2026)

Histogram-based Parameter-efficient Tuning for Passive and Active Sonar Classification
by: Mohammadi, Amirmohammad, et al.
Published: (2025)

Adaptive Discovery of Interpretable Audio Attributes with Multimodal LLMs for Low-Resource Classification
by: Yoshimura, Kosuke, et al.
Published: (2026)

PosCUDA: Position based Convolution for Unlearnable Audio Datasets
by: Gokul, Vignesh, et al.
Published: (2024)

Parametric Neural Amp Modeling with Active Learning
by: Grötschla, Florian, et al.
Published: (2025)

Constructing Composite Features for Interpretable Music-Tagging
by: Xue, Chenhao, et al.
Published: (2026)

PianoCoRe: Combined and Refined Piano MIDI Dataset
by: Borovik, Ilya
Published: (2026)

Heterogeneity-Aware Dataset Scheduling for Efficient Audio Large Language Model Training
by: Wu, Yanru, et al.
Published: (2026)

Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish
by: Junczyk, Michał
Published: (2024)

Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
by: Cheddad, Zohra Adila, et al.
Published: (2021)

Aggregation Strategies for Efficient Annotation of Bioacoustic Sound Events Using Active Learning
by: Lindholm, Richard, et al.
Published: (2025)

Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
by: Feng, Pengchao, et al.
Published: (2025)

SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech
by: Sabra, Adam, et al.
Published: (2024)

Regularized Schrödinger Bridge: Alleviating Distortion and Exposure Bias in Solving Inverse Problems
by: Yao, Qing, et al.
Published: (2025)

From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning
by: Martinsson, John, et al.
Published: (2024)

Interpretable SHAP-bounded Bayesian Optimization for Underwater Acoustic Metamaterial Coating Design
by: Weeratunge, Hansani, et al.
Published: (2025)

Focal Modulation Networks for Interpretable Sound Classification
by: Della Libera, Luca, et al.
Published: (2024)

Meta-Learning-Based Delayless Subband Adaptive Filter using Complex Self-Attention for Active Noise Control
by: Feng, Pengxing, et al.
Published: (2024)

Of All StrIPEs: Investigating Structure-informed Positional Encoding for Efficient Music Generation
by: Agarwal, Manvi, et al.
Published: (2025)

Semantic-Aware Interpretable Multimodal Music Auto-Tagging
by: Patakis, Andreas, et al.
Published: (2025)

PACE: Pretrained Audio Continual Learning
by: Li, Chang, et al.
Published: (2026)

The iNaturalist Sounds Dataset
by: Chasmai, Mustafa, et al.
Published: (2025)

A Data-Centric Framework for Machine Listening Projects: Addressing Large-Scale Data Acquisition and Labeling through Active Learning
by: Naranjo-Alcazar, Javier, et al.
Published: (2024)

Music Genre Classification Using Machine Learning Techniques
by: Mishra, Alokit, et al.
Published: (2025)

A Machine Learning Approach for Denoising and Upsampling HRTFs
by: Hu, Xuyi, et al.
Published: (2025)

Discovering and Steering Interpretable Concepts in Large Generative Music Models
by: Singh, Nikhil, et al.
Published: (2025)

Advancing Marine Bioacoustics with Deep Generative Models: A Hybrid Augmentation Strategy for Southern Resident Killer Whale Detection
by: Padovese, Bruno, et al.
Published: (2025)

Prototypical Contrastive Learning For Improved Few-Shot Audio Classification
by: Sgouropoulos, Christos, et al.
Published: (2025)

Aria-MIDI: A Dataset of Piano MIDI Files for Symbolic Music Modeling
by: Bradshaw, Louis, et al.
Published: (2025)

AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds
by: Wang, Qizhou, et al.
Published: (2025)

Echo: Towards Advanced Audio Comprehension via Audio-Interleaved Reasoning
by: Wu, Daiqing, et al.
Published: (2026)

Efficient Continual Learning in Keyword Spotting using Binary Neural Networks
by: Vu, Quynh Nguyen-Phuong, et al.
Published: (2025)

Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection
by: Zhang, Shiqi, et al.
Published: (2025)

CyIN: Cyclic Informative Latent Space for Bridging Complete and Incomplete Multimodal Learning
by: Lin, Ronghao, et al.
Published: (2026)

A Multimodal Framework for Dementia Detection via Linguistic and Acoustic Representation Learning
by: Ilias, Loukas, et al.
Published: (2026)

Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
by: Gundluru, Ramesh, et al.
Published: (2025)

Clustering of Indonesian and Western Gamelan Orchestras through Machine Learning of Performance Parameters
by: Linke, Simon, et al.
Published: (2024)