:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yadav, Manuj, Kim, Jungsoo, Hongisto, Valtteri, Cabrera, Densil, de Dear, Richard
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2501.15744
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Real-time auralization for performers on virtual stages
by: Accolti, Ernesto, et al.
Published: (2023)

De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks
by: Kallinen, Valtteri, et al.
Published: (2025)

Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
by: Ermert, Cosima A., et al.
Published: (2024)

Adversarial speech for voice privacy protection from Personalized Speech generation
by: Chen, Shihao, et al.
Published: (2024)

Physics-informed neural network for acoustic resonance analysis in a one-dimensional acoustic tube
by: Yokota, Kazuya, et al.
Published: (2023)

Computationally-efficient and perceptually-motivated rendering of diffuse reflections in room acoustics simulation
by: Ewert, Stephan D., et al.
Published: (2023)

Analysing the Masked predictive coding training criterion for pre-training a Speech Representation Model
by: Yadav, Hemant, et al.
Published: (2023)

NAST: Noise Aware Speech Tokenization for Speech Language Models
by: Messica, Shoval, et al.
Published: (2024)

Communication conditions in virtual acoustic scenes in an underground station
by: Hládek, Ľuboš, et al.
Published: (2021)

Robust DOA estimation using deep acoustic imaging
by: Roman, Adrian S., et al.
Published: (2024)

An interpretable speech foundation model for depression detection by revealing prediction-relevant acoustic features from long speech
by: Deng, Qingkun, et al.
Published: (2024)

A toolbox for rendering virtual acoustic environments in the context of audiology
by: Grimm, Giso, et al.
Published: (2018)

Guiding the underwater acoustic target recognition with interpretable contrastive learning
by: Xie, Yuan, et al.
Published: (2024)

Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models
by: Poncelet, Jakob, et al.
Published: (2024)

On the relevance of acoustic measurements for creating realistic virtual acoustic environments
by: Gündert, Siegfried, et al.
Published: (2023)

Cascaded noise reduction and acoustic echo cancellation based on an extended noise reduction
by: Roebben, Arnout, et al.
Published: (2024)

Investigating differences in lab-quality and remote recording methods with dynamic acoustic measures
by: Zhang, Cong, et al.
Published: (2024)

Deep, data-driven modeling of room acoustics: literature review and research perspectives
by: van Waterschoot, Toon
Published: (2025)

A state-space representation of the boundary integral equation for room acoustic modelling
by: Ali, Randall, et al.
Published: (2026)

Noise-Aware Speech Separation with Contrastive Learning
by: Zhang, Zizheng, et al.
Published: (2023)

A circular microphone array with virtual microphones based on acoustics-informed neural networks
by: Zhao, Sipei, et al.
Published: (2024)

AxLSTMs: learning self-supervised audio representations with xLSTMs
by: Yadav, Sarthak, et al.
Published: (2024)

Temporal Pooling Strategies for Training-Free Anomalous Sound Detection with Self-Supervised Audio Embeddings
by: Wilkinghoff, Kevin, et al.
Published: (2026)

Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018)

A Neural Speech Codec for Noise Robust Speech Coding
by: Huang, Jiayi, et al.
Published: (2023)

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting
by: Xi, Yu, et al.
Published: (2024)

DGSNA: Dynamic Generative Scene-based Noise Addition method
by: Chen, Zihao, et al.
Published: (2024)

Transient Noise Removal via Diffusion-based Speech Inpainting
by: Moradi, Mordehay, et al.
Published: (2025)

InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)

Theory and investigation of acoustic multiple-input multiple-output systems based on spherical arrays in a room
by: Morgenstern, Hai, et al.
Published: (2024)

Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
by: Ma, Yi, et al.
Published: (2024)

VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion
by: Byun, Kyungguen, et al.
Published: (2024)

Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection
by: Fan, Cunhang, et al.
Published: (2023)

Noisy Disentanglement with Tri-stage Training for Noise-Robust Speech Recognition
by: Chen, Shuangyuan, et al.
Published: (2025)

Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule
by: Wang, Siyi, et al.
Published: (2024)

Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection
by: Amiri, Mahdi, et al.
Published: (2024)

E2E-AEC: Implementing an end-to-end neural network learning approach for acoustic echo cancellation
by: Jiang, Yiheng, et al.
Published: (2026)

Real-time multichannel deep speech enhancement in hearing aids: Comparing monaural and binaural processing in complex acoustic scenarios
by: Westhausen, Nils L., et al.
Published: (2024)

Noise-to-mask Ratio Loss for Deep Neural Network based Audio Watermarking
by: Moritz, Martin, et al.
Published: (2024)

Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
by: Chae, Yunkee, et al.
Published: (2025)