Saved in:
| Main Authors: | Yu, Guochen, Han, Runqiang, Xu, Chenglin, Zhao, Haoran, Li, Nan, Zhang, Chen, Zheng, Xiguang, Zhou, Chao, Huang, Qi, Yu, Bing |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.01808 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Detecting gamma-band responses to the speech envelope for the ICASSP 2024 Auditory EEG Decoding Signal Processing Grand Challenge
by: Thornton, Mike, et al.
Published: (2024)
by: Thornton, Mike, et al.
Published: (2024)
The ICASSP 2024 Audio Deep Packet Loss Concealment Challenge
by: Diener, Lorenz, et al.
Published: (2024)
by: Diener, Lorenz, et al.
Published: (2024)
ICASSP 2026 URGENT Speech Enhancement Challenge
by: Li, Chenda, et al.
Published: (2026)
by: Li, Chenda, et al.
Published: (2026)
The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge
by: Ma, Guobin, et al.
Published: (2026)
by: Ma, Guobin, et al.
Published: (2026)
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
by: Wang, He, et al.
Published: (2024)
by: Wang, He, et al.
Published: (2024)
ICASSP 2024 Speech Signal Improvement Challenge
by: Ristea, Nicolae Catalin, et al.
Published: (2024)
by: Ristea, Nicolae Catalin, et al.
Published: (2024)
PhoenixCodec: Taming Neural Speech Coding for Extreme Low-Resource Scenarios
by: Wan, Zixiang, et al.
Published: (2025)
by: Wan, Zixiang, et al.
Published: (2025)
Inter-channel Conv-TasNet for multichannel speech enhancement
by: Lee, Dongheon, et al.
Published: (2021)
by: Lee, Dongheon, et al.
Published: (2021)
Deep low-latency joint speech transmission and enhancement over a gaussian channel
by: Bokaei, Mohammad, et al.
Published: (2024)
by: Bokaei, Mohammad, et al.
Published: (2024)
RaD-Net 2: A causal two-stage repairing and denoising speech enhancement network with knowledge distillation and complex axial self-attention
by: Liu, Mingshuai, et al.
Published: (2024)
by: Liu, Mingshuai, et al.
Published: (2024)
A lightweight dual-stage framework for personalized speech enhancement based on DeepFilterNet2
by: Serre, Thomas, et al.
Published: (2024)
by: Serre, Thomas, et al.
Published: (2024)
SPGM: Prioritizing Local Features for enhanced speech separation performance
by: Yip, Jia Qi, et al.
Published: (2023)
by: Yip, Jia Qi, et al.
Published: (2023)
Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024)
by: Wu, Boyong, et al.
Published: (2024)
An adaptive filter bank based neural network approach for time delay estimation and speech enhancement
by: Ma, Lu
Published: (2025)
by: Ma, Lu
Published: (2025)
Music Enhancement with Deep Filters: A Technical Report for The ICASSP 2024 Cadenza Challenge
by: Shao, Keren, et al.
Published: (2024)
by: Shao, Keren, et al.
Published: (2024)
Towards noise-robust speech inversion through multi-task learning with speech enhancement
by: Tabatabaee, Saba, et al.
Published: (2026)
by: Tabatabaee, Saba, et al.
Published: (2026)
TTS-CtrlNet: Time varying emotion aligned text-to-speech generation with ControlNet
by: Jeong, Jaeseok, et al.
Published: (2025)
by: Jeong, Jaeseok, et al.
Published: (2025)
Unsupervised speech enhancement with spectral kurtosis and double deep priors
by: Ohnaka, Hien, et al.
Published: (2024)
by: Ohnaka, Hien, et al.
Published: (2024)
GDiffuSE: Diffusion-based speech enhancement with noise model guidance
by: Yanir, Efrayim, et al.
Published: (2025)
by: Yanir, Efrayim, et al.
Published: (2025)
Monaural speech enhancement on drone via Adapter based transfer learning
by: Chen, Xingyu, et al.
Published: (2024)
by: Chen, Xingyu, et al.
Published: (2024)
Determined blind source separation via modeling adjacent frequency band correlations in speech signals
by: Wang, Jianyu, et al.
Published: (2025)
by: Wang, Jianyu, et al.
Published: (2025)
Modeling strategies for speech enhancement in the latent space of a neural audio codec
by: Kammoun, Sofiene, et al.
Published: (2025)
by: Kammoun, Sofiene, et al.
Published: (2025)
Using RLHF to align speech enhancement approaches to mean-opinion quality scores
by: Kumar, Anurag, et al.
Published: (2024)
by: Kumar, Anurag, et al.
Published: (2024)
Single-channel speech enhancement by using psychoacoustical model inspired fusion framework
by: Samui, Suman
Published: (2022)
by: Samui, Suman
Published: (2022)
A quest through interconnected datasets: lessons from highly-cited ICASSP papers
by: Liem, Cynthia C. S., et al.
Published: (2024)
by: Liem, Cynthia C. S., et al.
Published: (2024)
Enhancement by postfiltering for speech and audio coding in ad-hoc sensor networks
by: Das, Sneha, et al.
Published: (2020)
by: Das, Sneha, et al.
Published: (2020)
Graph-based multi-Feature fusion method for speech emotion recognition
by: Liu, Xueyu, et al.
Published: (2024)
by: Liu, Xueyu, et al.
Published: (2024)
The CHiME-7 UDASE task: Unsupervised domain adaptation for conversational speech enhancement
by: Leglaive, Simon, et al.
Published: (2023)
by: Leglaive, Simon, et al.
Published: (2023)
Configurable EBEN: Extreme Bandwidth Extension Network to enhance body-conducted speech capture
by: Hauret, Julien, et al.
Published: (2023)
by: Hauret, Julien, et al.
Published: (2023)
Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
by: Yin, Han, et al.
Published: (2024)
by: Yin, Han, et al.
Published: (2024)
Throat and acoustic paired speech dataset for deep learning-based speech enhancement
by: Kim, Yunsik, et al.
Published: (2025)
by: Kim, Yunsik, et al.
Published: (2025)
An automatic mixing speech enhancement system for multi-track audio
by: Liu, Xiaojing, et al.
Published: (2024)
by: Liu, Xiaojing, et al.
Published: (2024)
A two-step approach for speech enhancement in low-SNR scenarios using cyclostationary beamforming and DNNs
by: Bologni, Giovanni, et al.
Published: (2026)
by: Bologni, Giovanni, et al.
Published: (2026)
From the perspective of perceptual speech quality: The robustness of frequency bands to noise
by: Fan, Junyi, et al.
Published: (2025)
by: Fan, Junyi, et al.
Published: (2025)
The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
by: Huang, Wen-Chin, et al.
Published: (2024)
by: Huang, Wen-Chin, et al.
Published: (2024)
Effects of automotive microphone frequency response characteristics and noise conditions on speech and ASR quality -- an experimental evaluation
by: Buccoli, Michele, et al.
Published: (2025)
by: Buccoli, Michele, et al.
Published: (2025)
Real-time multichannel deep speech enhancement in hearing aids: Comparing monaural and binaural processing in complex acoustic scenarios
by: Westhausen, Nils L., et al.
Published: (2024)
by: Westhausen, Nils L., et al.
Published: (2024)
On the relationship between speech and hearing
by: Umesh, Srinivasan, et al.
Published: (2024)
by: Umesh, Srinivasan, et al.
Published: (2024)
The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era
by: Zhao, Zhixian, et al.
Published: (2026)
by: Zhao, Zhixian, et al.
Published: (2026)
Robust fine-tuning of speech recognition models via model merging: application to disordered speech
by: Ducorroy, Alexandre, et al.
Published: (2025)
by: Ducorroy, Alexandre, et al.
Published: (2025)
Similar Items
-
Detecting gamma-band responses to the speech envelope for the ICASSP 2024 Auditory EEG Decoding Signal Processing Grand Challenge
by: Thornton, Mike, et al.
Published: (2024) -
The ICASSP 2024 Audio Deep Packet Loss Concealment Challenge
by: Diener, Lorenz, et al.
Published: (2024) -
ICASSP 2026 URGENT Speech Enhancement Challenge
by: Li, Chenda, et al.
Published: (2026) -
The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge
by: Ma, Guobin, et al.
Published: (2026) -
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge
by: Wang, He, et al.
Published: (2024)