:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Haider, Daniel, Perfler, Felix, Balazs, Peter, Hollomey, Clara, Holighaus, Nicki
Format:	Preprint
Published:	2025
Subjects:	Sound Machine Learning
Online Access:	https://arxiv.org/abs/2505.07709
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
by: Haider, Daniel, et al.
Published: (2024)

Phase-Based Signal Representations for Scattering
by: Haider, Daniel, et al.
Published: (2022)

Aliasing in Convnets: A Frame-Theoretic Perspective
by: Haider, Daniel, et al.
Published: (2025)

Fitting Auditory Filterbanks with Multiresolution Neural Networks
by: Lostanlen, Vincent, et al.
Published: (2023)

Instabilities in Convnets for Raw Audio
by: Haider, Daniel, et al.
Published: (2023)

SELEBI: Percussion-aware Time Stretching via Selective Magnitude Spectrogram Compression by Nonstationary Gabor Transform
by: Akaishi, Natsuki, et al.
Published: (2026)

AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?
by: Ok, Hyunjong, et al.
Published: (2025)

Continuous warped time-frequency representations - Coorbit spaces and discretization
by: Holighaus, Nicki, et al.
Published: (2015)

Differentiable Attenuation Filters for Feedback Delay Networks
by: Ibnyahya, Ilias, et al.
Published: (2025)

OASI: Objective-Aware Surrogate Initialization for Multi-Objective Bayesian Optimization in TinyML Keyword Spotting
by: Garai, Soumen, et al.
Published: (2025)

Papez: Resource-Efficient Speech Separation with Auditory Working Memory
by: Oh, Hyunseok, et al.
Published: (2024)

AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
by: Adeli, Behtom, et al.
Published: (2025)

A Convolutional Framework for Mapping Imagined Auditory MEG into Listened Brain Responses
by: Maghsoudi, Maryam, et al.
Published: (2025)

DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention Detection
by: Yan, Sheng, et al.
Published: (2024)

Smoothness spaces for warped time-frequency representations -- Decomposition spaces and embedding relations
by: Holighaus, Nicki, et al.
Published: (2024)

Coorbit theory of warped time-frequency systems in $\mathbb{R}^d$
by: Holighaus, Nicki, et al.
Published: (2022)

A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments
by: Khondkar, Md Jahangir Alam, et al.
Published: (2025)

TinySV: Speaker Verification in TinyML with On-device Learning
by: Pavan, Massimo, et al.
Published: (2024)

Understanding Auditory Evoked Brain Signal via Physics-informed Embedding Network with Multi-Task Transformer
by: Ma, Wanli, et al.
Published: (2024)

WavInWav: Time-domain Speech Hiding via Invertible Neural Network
by: Fan, Wei, et al.
Published: (2025)

Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)

Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models
by: Yoo, Suho, et al.
Published: (2025)

Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening
by: Akama, Taketo, et al.
Published: (2025)

What Do Language Models Hear? Probing for Auditory Representations in Language Models
by: Ngo, Jerry, et al.
Published: (2024)

Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
by: Segal-Feldman, Yael, et al.
Published: (2025)

Differentiable All-pole Filters for Time-varying Audio Systems
by: Yu, Chin-Yun, et al.
Published: (2024)

chatter: a Python library for applying information theory and AI/ML models to animal communication
by: Youngblood, Mason
Published: (2025)

Edge Intelligence for Wildlife Conservation: Real-Time Hornbill Call Classification Using TinyML
by: Hing, Kong Ka, et al.
Published: (2025)

AADNet: Exploring EEG Spatiotemporal Information for Fast and Accurate Orientation and Timbre Detection of Auditory Attention Based on A Cue-Masked Paradigm
by: Shi, Keren, et al.
Published: (2025)

Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods
by: Park, Siwoo
Published: (2025)

ADNAC: Audio Denoiser using Neural Audio Codec
by: Jimon, Daniel, et al.
Published: (2025)

Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
by: Sun, Yifan, et al.
Published: (2025)

Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters
by: Tesch, Kristina, et al.
Published: (2023)

MK-SGC-SC: Multiple Kernel Guided Sparse Graph Construction in Spectral Clustering for Unsupervised Speaker Diarization
by: Raghav, Nikhil, et al.
Published: (2026)

TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors
by: Huang, Zhaolan, et al.
Published: (2024)

Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification
by: Milling, Manuel, et al.
Published: (2023)

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers
by: Kienegger, Jakob, et al.
Published: (2026)

Steering Deep Non-Linear Spatially Selective Filters for Weakly Guided Extraction of Moving Speakers in Dynamic Scenarios
by: Kienegger, Jakob, et al.
Published: (2025)

Meta-Learning-Based Delayless Subband Adaptive Filter using Complex Self-Attention for Active Noise Control
by: Feng, Pengxing, et al.
Published: (2024)

Safeguarding Privacy in Edge Speech Understanding with Tiny Foundation Models
by: Benazir, Afsara, et al.
Published: (2025)