:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Togami, Masahito, Valin, Jean-Marc, Helwani, Karim, Giri, Ritwik, Isik, Umut, Goodwin, Michael M.
Format:	Preprint
Published:	2024
Subjects:	Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2402.00337
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping
by: Büthe, Jan, et al.
Published: (2023)

Sound Source Separation Using Latent Variational Block-Wise Disentanglement
by: Helwani, Karim, et al.
Published: (2024)

RADE: A Neural Codec for Transmitting Speech over HF Radio Channels
by: Rowe, David, et al.
Published: (2025)

A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation
by: Wang, Jingyuan, et al.
Published: (2024)

Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch Prediction
by: Valin, Jean-Marc, et al.
Published: (2024)

A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)

Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity
by: Subramani, Krishna, et al.
Published: (2023)

DRED: Deep REDundancy Coding of Speech Using a Rate-Distortion-Optimized Variational Autoencoder
by: Valin, Jean-Marc, et al.
Published: (2022)

A lightweight and robust method for blind wideband-to-fullband extension of speech
by: Büthe, Jan, et al.
Published: (2024)

WTFormer: A Wavelet Conformer Network for MIMO Speech Enhancement with Spatial Cues Peservation
by: Han, Lu, et al.
Published: (2025)

LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement
by: Yan, Haoyin, et al.
Published: (2024)

ZipEnhancer: Dual-Path Down-Up Sampling-based Zipformer for Monaural Speech Enhancement
by: Wang, Haoxu, et al.
Published: (2025)

Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues
by: Hussain, Tassadaq, et al.
Published: (2024)

DAT-CFTNet: Speech Enhancement for Cochlear Implant Recipients using Attention-based Dual-Path Recurrent Neural Network
by: Mamun, Nursadul, et al.
Published: (2026)

Audio-Visual Speech Enhancement for Spatial Audio - Spatial-VisualVoice and the MAVE Database
by: Yaffe, Danielle, et al.
Published: (2025)

Direction-Preserving MIMO Speech Enhancement Using a Neural Covariance Estimator
by: Deppisch, Thomas
Published: (2026)

Universal Score-based Speech Enhancement with High Content Preservation
by: Scheibler, Robin, et al.
Published: (2024)

Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement
by: Zheng, Tianqin, et al.
Published: (2025)

An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement
by: Ku, Pin-Jui, et al.
Published: (2024)

Leveraging Spatial Cues from Cochlear Implant Microphones to Efficiently Enhance Speech Separation in Real-World Listening Scenes
by: Olalere, Feyisayo, et al.
Published: (2025)

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions
by: La Quatra, Moreno, et al.
Published: (2024)

A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
by: Guo, Zilu, et al.
Published: (2024)

Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
by: Pandey, Ashutosh, et al.
Published: (2024)

Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
by: Ren, Wenze, et al.
Published: (2024)

A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions
by: Wang, Zheng, et al.
Published: (2025)

DroFiT: A Lightweight Band-fused Frequency Attention Toward Real-time UAV Speech Enhancement
by: Lee, Jeongmin, et al.
Published: (2025)

Influence of Clean Speech Characteristics on Speech Enhancement Performance
by: Hou, Mingchi, et al.
Published: (2025)

Exploring Efficient Directional and Distance Cues for Regional Speech Separation
by: Jiang, Yiheng, et al.
Published: (2025)

Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
by: Ochiai, Tsubasa, et al.
Published: (2024)

Exploiting Consistency-Preserving Loss and Perceptual Contrast Stretching to Boost SSL-based Speech Enhancement
by: Khan, Muhammad Salman, et al.
Published: (2024)

Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario
by: Wen, Wen, et al.
Published: (2024)

Conditional Latent Diffusion-Based Speech Enhancement Via Dual Context Learning
by: Zhao, Shengkui, et al.
Published: (2025)

A Dual-Branch Parallel Network for Speech Enhancement and Restoration
by: Yang, Da-Hee, et al.
Published: (2024)

Inter-Speaker Relative Cues for Two-Stage Text-Guided Target Speech Extraction
by: Dai, Wang, et al.
Published: (2026)

A Semi-spontaneous Dutch Speech Dataset for Speech Enhancement and Speech Recognition
by: de Groot, Dimme, et al.
Published: (2026)

Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
by: Yang, Bing, et al.
Published: (2024)

Hybrid Real- And Complex-Valued Neural Network Concept For Low-Complexity Phase-Aware Speech Enhancement
by: Fiorio, Luan Vinícius, et al.
Published: (2025)

Data Augmentation for Pathological Speech Enhancement
by: Hou, Mingchi, et al.
Published: (2026)

Schrödinger Bridge for Generative Speech Enhancement
by: Jukić, Ante, et al.
Published: (2024)