Saved in:
| Main Authors: | Yadav, Manuj, Kim, Jungsoo, Hongisto, Valtteri, Cabrera, Densil, de Dear, Richard |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.15744 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Real-time auralization for performers on virtual stages
by: Accolti, Ernesto, et al.
Published: (2023)
by: Accolti, Ernesto, et al.
Published: (2023)
De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks
by: Kallinen, Valtteri, et al.
Published: (2025)
by: Kallinen, Valtteri, et al.
Published: (2025)
Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
by: Ermert, Cosima A., et al.
Published: (2024)
by: Ermert, Cosima A., et al.
Published: (2024)
Adversarial speech for voice privacy protection from Personalized Speech generation
by: Chen, Shihao, et al.
Published: (2024)
by: Chen, Shihao, et al.
Published: (2024)
Physics-informed neural network for acoustic resonance analysis in a one-dimensional acoustic tube
by: Yokota, Kazuya, et al.
Published: (2023)
by: Yokota, Kazuya, et al.
Published: (2023)
Computationally-efficient and perceptually-motivated rendering of diffuse reflections in room acoustics simulation
by: Ewert, Stephan D., et al.
Published: (2023)
by: Ewert, Stephan D., et al.
Published: (2023)
Analysing the Masked predictive coding training criterion for pre-training a Speech Representation Model
by: Yadav, Hemant, et al.
Published: (2023)
by: Yadav, Hemant, et al.
Published: (2023)
NAST: Noise Aware Speech Tokenization for Speech Language Models
by: Messica, Shoval, et al.
Published: (2024)
by: Messica, Shoval, et al.
Published: (2024)
Communication conditions in virtual acoustic scenes in an underground station
by: Hládek, Ľuboš, et al.
Published: (2021)
by: Hládek, Ľuboš, et al.
Published: (2021)
Robust DOA estimation using deep acoustic imaging
by: Roman, Adrian S., et al.
Published: (2024)
by: Roman, Adrian S., et al.
Published: (2024)
An interpretable speech foundation model for depression detection by revealing prediction-relevant acoustic features from long speech
by: Deng, Qingkun, et al.
Published: (2024)
by: Deng, Qingkun, et al.
Published: (2024)
A toolbox for rendering virtual acoustic environments in the context of audiology
by: Grimm, Giso, et al.
Published: (2018)
by: Grimm, Giso, et al.
Published: (2018)
Guiding the underwater acoustic target recognition with interpretable contrastive learning
by: Xie, Yuan, et al.
Published: (2024)
by: Xie, Yuan, et al.
Published: (2024)
Efficient Extraction of Noise-Robust Discrete Units from Self-Supervised Speech Models
by: Poncelet, Jakob, et al.
Published: (2024)
by: Poncelet, Jakob, et al.
Published: (2024)
On the relevance of acoustic measurements for creating realistic virtual acoustic environments
by: Gündert, Siegfried, et al.
Published: (2023)
by: Gündert, Siegfried, et al.
Published: (2023)
Cascaded noise reduction and acoustic echo cancellation based on an extended noise reduction
by: Roebben, Arnout, et al.
Published: (2024)
by: Roebben, Arnout, et al.
Published: (2024)
Investigating differences in lab-quality and remote recording methods with dynamic acoustic measures
by: Zhang, Cong, et al.
Published: (2024)
by: Zhang, Cong, et al.
Published: (2024)
Deep, data-driven modeling of room acoustics: literature review and research perspectives
by: van Waterschoot, Toon
Published: (2025)
by: van Waterschoot, Toon
Published: (2025)
A state-space representation of the boundary integral equation for room acoustic modelling
by: Ali, Randall, et al.
Published: (2026)
by: Ali, Randall, et al.
Published: (2026)
Noise-Aware Speech Separation with Contrastive Learning
by: Zhang, Zizheng, et al.
Published: (2023)
by: Zhang, Zizheng, et al.
Published: (2023)
A circular microphone array with virtual microphones based on acoustics-informed neural networks
by: Zhao, Sipei, et al.
Published: (2024)
by: Zhao, Sipei, et al.
Published: (2024)
AxLSTMs: learning self-supervised audio representations with xLSTMs
by: Yadav, Sarthak, et al.
Published: (2024)
by: Yadav, Sarthak, et al.
Published: (2024)
Temporal Pooling Strategies for Training-Free Anomalous Sound Detection with Self-Supervised Audio Embeddings
by: Wilkinghoff, Kevin, et al.
Published: (2026)
by: Wilkinghoff, Kevin, et al.
Published: (2026)
Automatic acoustic detection of birds through deep learning: the first Bird Audio Detection challenge
by: Stowell, Dan, et al.
Published: (2018)
by: Stowell, Dan, et al.
Published: (2018)
A Neural Speech Codec for Noise Robust Speech Coding
by: Huang, Jiayi, et al.
Published: (2023)
by: Huang, Jiayi, et al.
Published: (2023)
NTC-KWS: Noise-aware CTC for Robust Keyword Spotting
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
DGSNA: Dynamic Generative Scene-based Noise Addition method
by: Chen, Zihao, et al.
Published: (2024)
by: Chen, Zihao, et al.
Published: (2024)
Transient Noise Removal via Diffusion-based Speech Inpainting
by: Moradi, Mordehay, et al.
Published: (2025)
by: Moradi, Mordehay, et al.
Published: (2025)
InsectSet459: an open dataset of insect sounds for bioacoustic machine learning
by: Faiß, Marius, et al.
Published: (2025)
by: Faiß, Marius, et al.
Published: (2025)
Theory and investigation of acoustic multiple-input multiple-output systems based on spherical arrays in a room
by: Morgenstern, Hai, et al.
Published: (2024)
by: Morgenstern, Hai, et al.
Published: (2024)
Gradient weighting for speaker verification in extremely low Signal-to-Noise Ratio
by: Ma, Yi, et al.
Published: (2024)
by: Ma, Yi, et al.
Published: (2024)
VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion
by: Byun, Kyungguen, et al.
Published: (2024)
by: Byun, Kyungguen, et al.
Published: (2024)
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection
by: Fan, Cunhang, et al.
Published: (2023)
by: Fan, Cunhang, et al.
Published: (2023)
Noisy Disentanglement with Tri-stage Training for Noise-Robust Speech Recognition
by: Chen, Shuangyuan, et al.
Published: (2025)
by: Chen, Shuangyuan, et al.
Published: (2025)
Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule
by: Wang, Siyi, et al.
Published: (2024)
by: Wang, Siyi, et al.
Published: (2024)
Suppressing Noise Disparity in Training Data for Automatic Pathological Speech Detection
by: Amiri, Mahdi, et al.
Published: (2024)
by: Amiri, Mahdi, et al.
Published: (2024)
E2E-AEC: Implementing an end-to-end neural network learning approach for acoustic echo cancellation
by: Jiang, Yiheng, et al.
Published: (2026)
by: Jiang, Yiheng, et al.
Published: (2026)
Real-time multichannel deep speech enhancement in hearing aids: Comparing monaural and binaural processing in complex acoustic scenarios
by: Westhausen, Nils L., et al.
Published: (2024)
by: Westhausen, Nils L., et al.
Published: (2024)
Noise-to-mask Ratio Loss for Deep Neural Network based Audio Watermarking
by: Moritz, Martin, et al.
Published: (2024)
by: Moritz, Martin, et al.
Published: (2024)
Towards Bitrate-Efficient and Noise-Robust Speech Coding with Variable Bitrate RVQ
by: Chae, Yunkee, et al.
Published: (2025)
by: Chae, Yunkee, et al.
Published: (2025)
Similar Items
-
Real-time auralization for performers on virtual stages
by: Accolti, Ernesto, et al.
Published: (2023) -
De-crackling Virtual Analog Controls with Asymptotically Stable Recurrent Neural Networks
by: Kallinen, Valtteri, et al.
Published: (2025) -
Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
by: Ermert, Cosima A., et al.
Published: (2024) -
Adversarial speech for voice privacy protection from Personalized Speech generation
by: Chen, Shihao, et al.
Published: (2024) -
Physics-informed neural network for acoustic resonance analysis in a one-dimensional acoustic tube
by: Yokota, Kazuya, et al.
Published: (2023)