Saved in:
| Main Authors: | Haider, Daniel, Perfler, Felix, Balazs, Peter, Hollomey, Clara, Holighaus, Nicki |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.07709 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
by: Haider, Daniel, et al.
Published: (2024)
by: Haider, Daniel, et al.
Published: (2024)
Phase-Based Signal Representations for Scattering
by: Haider, Daniel, et al.
Published: (2022)
by: Haider, Daniel, et al.
Published: (2022)
Aliasing in Convnets: A Frame-Theoretic Perspective
by: Haider, Daniel, et al.
Published: (2025)
by: Haider, Daniel, et al.
Published: (2025)
Fitting Auditory Filterbanks with Multiresolution Neural Networks
by: Lostanlen, Vincent, et al.
Published: (2023)
by: Lostanlen, Vincent, et al.
Published: (2023)
Instabilities in Convnets for Raw Audio
by: Haider, Daniel, et al.
Published: (2023)
by: Haider, Daniel, et al.
Published: (2023)
SELEBI: Percussion-aware Time Stretching via Selective Magnitude Spectrogram Compression by Nonstationary Gabor Transform
by: Akaishi, Natsuki, et al.
Published: (2026)
by: Akaishi, Natsuki, et al.
Published: (2026)
AuditoryBench++: Can Language Models Understand Auditory Knowledge without Hearing?
by: Ok, Hyunjong, et al.
Published: (2025)
by: Ok, Hyunjong, et al.
Published: (2025)
Continuous warped time-frequency representations - Coorbit spaces and discretization
by: Holighaus, Nicki, et al.
Published: (2015)
by: Holighaus, Nicki, et al.
Published: (2015)
Differentiable Attenuation Filters for Feedback Delay Networks
by: Ibnyahya, Ilias, et al.
Published: (2025)
by: Ibnyahya, Ilias, et al.
Published: (2025)
OASI: Objective-Aware Surrogate Initialization for Multi-Objective Bayesian Optimization in TinyML Keyword Spotting
by: Garai, Soumen, et al.
Published: (2025)
by: Garai, Soumen, et al.
Published: (2025)
Papez: Resource-Efficient Speech Separation with Auditory Working Memory
by: Oh, Hyunseok, et al.
Published: (2024)
by: Oh, Hyunseok, et al.
Published: (2024)
AbsoluteNet: A Deep Learning Neural Network to Classify Cerebral Hemodynamic Responses of Auditory Processing
by: Adeli, Behtom, et al.
Published: (2025)
by: Adeli, Behtom, et al.
Published: (2025)
A Convolutional Framework for Mapping Imagined Auditory MEG into Listened Brain Responses
by: Maghsoudi, Maryam, et al.
Published: (2025)
by: Maghsoudi, Maryam, et al.
Published: (2025)
DARNet: Dual Attention Refinement Network with Spatiotemporal Construction for Auditory Attention Detection
by: Yan, Sheng, et al.
Published: (2024)
by: Yan, Sheng, et al.
Published: (2024)
Smoothness spaces for warped time-frequency representations -- Decomposition spaces and embedding relations
by: Holighaus, Nicki, et al.
Published: (2024)
by: Holighaus, Nicki, et al.
Published: (2024)
Coorbit theory of warped time-frequency systems in $\mathbb{R}^d$
by: Holighaus, Nicki, et al.
Published: (2022)
by: Holighaus, Nicki, et al.
Published: (2022)
A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments
by: Khondkar, Md Jahangir Alam, et al.
Published: (2025)
by: Khondkar, Md Jahangir Alam, et al.
Published: (2025)
TinySV: Speaker Verification in TinyML with On-device Learning
by: Pavan, Massimo, et al.
Published: (2024)
by: Pavan, Massimo, et al.
Published: (2024)
Understanding Auditory Evoked Brain Signal via Physics-informed Embedding Network with Multi-Task Transformer
by: Ma, Wanli, et al.
Published: (2024)
by: Ma, Wanli, et al.
Published: (2024)
WavInWav: Time-domain Speech Hiding via Invertible Neural Network
by: Fan, Wei, et al.
Published: (2025)
by: Fan, Wei, et al.
Published: (2025)
Improving Underwater Acoustic Classification Through Learnable Gabor Filter Convolution and Attention Mechanisms
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)
by: Domingos, Lucas Cesar Ferreira, et al.
Published: (2025)
Imagine to Hear: Auditory Knowledge Generation can be an Effective Assistant for Language Models
by: Yoo, Suho, et al.
Published: (2025)
by: Yoo, Suho, et al.
Published: (2025)
Decoding Selective Auditory Attention to Musical Elements in Ecologically Valid Music Listening
by: Akama, Taketo, et al.
Published: (2025)
by: Akama, Taketo, et al.
Published: (2025)
What Do Language Models Hear? Probing for Auditory Representations in Language Models
by: Ngo, Jerry, et al.
Published: (2024)
by: Ngo, Jerry, et al.
Published: (2024)
Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
by: Segal-Feldman, Yael, et al.
Published: (2025)
by: Segal-Feldman, Yael, et al.
Published: (2025)
Differentiable All-pole Filters for Time-varying Audio Systems
by: Yu, Chin-Yun, et al.
Published: (2024)
by: Yu, Chin-Yun, et al.
Published: (2024)
chatter: a Python library for applying information theory and AI/ML models to animal communication
by: Youngblood, Mason
Published: (2025)
by: Youngblood, Mason
Published: (2025)
Edge Intelligence for Wildlife Conservation: Real-Time Hornbill Call Classification Using TinyML
by: Hing, Kong Ka, et al.
Published: (2025)
by: Hing, Kong Ka, et al.
Published: (2025)
AADNet: Exploring EEG Spatiotemporal Information for Fast and Accurate Orientation and Timbre Detection of Auditory Attention Based on A Cue-Masked Paradigm
by: Shi, Keren, et al.
Published: (2025)
by: Shi, Keren, et al.
Published: (2025)
Investigating the Invertibility of Multimodal Latent Spaces: Limitations of Optimization-Based Methods
by: Park, Siwoo
Published: (2025)
by: Park, Siwoo
Published: (2025)
ADNAC: Audio Denoiser using Neural Audio Codec
by: Jimon, Daniel, et al.
Published: (2025)
by: Jimon, Daniel, et al.
Published: (2025)
Hankel-FNO: Fast Underwater Acoustic Charting Via Physics-Encoded Fourier Neural Operator
by: Sun, Yifan, et al.
Published: (2025)
by: Sun, Yifan, et al.
Published: (2025)
Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters
by: Tesch, Kristina, et al.
Published: (2023)
by: Tesch, Kristina, et al.
Published: (2023)
MK-SGC-SC: Multiple Kernel Guided Sparse Graph Construction in Spectral Clustering for Unsupervised Speaker Diarization
by: Raghav, Nikhil, et al.
Published: (2026)
by: Raghav, Nikhil, et al.
Published: (2026)
TinyChirp: Bird Song Recognition Using TinyML Models on Low-power Wireless Acoustic Sensors
by: Huang, Zhaolan, et al.
Published: (2024)
by: Huang, Zhaolan, et al.
Published: (2024)
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene Classification
by: Milling, Manuel, et al.
Published: (2023)
by: Milling, Manuel, et al.
Published: (2023)
Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers
by: Kienegger, Jakob, et al.
Published: (2026)
by: Kienegger, Jakob, et al.
Published: (2026)
Steering Deep Non-Linear Spatially Selective Filters for Weakly Guided Extraction of Moving Speakers in Dynamic Scenarios
by: Kienegger, Jakob, et al.
Published: (2025)
by: Kienegger, Jakob, et al.
Published: (2025)
Meta-Learning-Based Delayless Subband Adaptive Filter using Complex Self-Attention for Active Noise Control
by: Feng, Pengxing, et al.
Published: (2024)
by: Feng, Pengxing, et al.
Published: (2024)
Safeguarding Privacy in Edge Speech Understanding with Tiny Foundation Models
by: Benazir, Afsara, et al.
Published: (2025)
by: Benazir, Afsara, et al.
Published: (2025)
Similar Items
-
Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
by: Haider, Daniel, et al.
Published: (2024) -
Phase-Based Signal Representations for Scattering
by: Haider, Daniel, et al.
Published: (2022) -
Aliasing in Convnets: A Frame-Theoretic Perspective
by: Haider, Daniel, et al.
Published: (2025) -
Fitting Auditory Filterbanks with Multiresolution Neural Networks
by: Lostanlen, Vincent, et al.
Published: (2023) -
Instabilities in Convnets for Raw Audio
by: Haider, Daniel, et al.
Published: (2023)