Saved in:
| Main Authors: | Rong, Xiaobin, Wang, Yushi, Wang, Zheng, Lu, Jing |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.01832 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network
by: Rong, Xiaobin, et al.
Published: (2025)
by: Rong, Xiaobin, et al.
Published: (2025)
Adaptive Convolution for CNN-based Speech Enhancement Models
by: Wang, Dahan, et al.
Published: (2025)
by: Wang, Dahan, et al.
Published: (2025)
UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations
by: Rong, Xiaobin, et al.
Published: (2026)
by: Rong, Xiaobin, et al.
Published: (2026)
Universal Speech Enhancement with Regression and Generative Mamba
by: Chao, Rong, et al.
Published: (2025)
by: Chao, Rong, et al.
Published: (2025)
A Hybrid Discriminative and Generative System for Universal Speech Enhancement
by: Liu, Yinghao, et al.
Published: (2026)
by: Liu, Yinghao, et al.
Published: (2026)
Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network
by: Chen, Yanan, et al.
Published: (2024)
by: Chen, Yanan, et al.
Published: (2024)
Speech Emotion Recognition Using Fine-Tuned DWFormer:A Study on Track 1 of the IERPChallenge 2024
by: Wang, Honghong, et al.
Published: (2025)
by: Wang, Honghong, et al.
Published: (2025)
FUSE: Universal Speech Enhancement using Multi-Stage Fusion of Sparse Compression and Token Generation Models for the URGENT 2025 Challenge
by: Goswami, Nabarun, et al.
Published: (2025)
by: Goswami, Nabarun, et al.
Published: (2025)
Low-latency Speech Enhancement via Speech Token Generation
by: Xue, Huaying, et al.
Published: (2023)
by: Xue, Huaying, et al.
Published: (2023)
Leveraging LLM for Stuttering Speech: A Unified Architecture Bridging Recognition and Event Detection
by: Huang, Shangkun, et al.
Published: (2025)
by: Huang, Shangkun, et al.
Published: (2025)
Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention
by: Zhang, Yuewei, et al.
Published: (2025)
by: Zhang, Yuewei, et al.
Published: (2025)
Toward Universal Speech Enhancement for Diverse Input Conditions
by: Zhang, Wangyou, et al.
Published: (2023)
by: Zhang, Wangyou, et al.
Published: (2023)
URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement
by: Zhang, Wangyou, et al.
Published: (2024)
by: Zhang, Wangyou, et al.
Published: (2024)
SNR-Progressive Model with Harmonic Compensation for Low-SNR Speech Enhancement
by: Hou, Zhongshu, et al.
Published: (2024)
by: Hou, Zhongshu, et al.
Published: (2024)
Speech Enhancement with Dual-path Multi-Channel Linear Prediction Filter and Multi-norm Beamforming
by: Qin, Chengyuan, et al.
Published: (2025)
by: Qin, Chengyuan, et al.
Published: (2025)
A Lightweight Fourier-based Network for Binaural Speech Enhancement with Spatial Cue Preservation
by: Lu, Xikun, et al.
Published: (2025)
by: Lu, Xikun, et al.
Published: (2025)
Universal Score-based Speech Enhancement with High Content Preservation
by: Scheibler, Robin, et al.
Published: (2024)
by: Scheibler, Robin, et al.
Published: (2024)
EffiFusion-GAN: Efficient Fusion Generative Adversarial Network for Speech Enhancement
by: Wen, Bin, et al.
Published: (2025)
by: Wen, Bin, et al.
Published: (2025)
Combined Generative and Predictive Modeling for Speech Super-resolution
by: Wang, Heming, et al.
Published: (2024)
by: Wang, Heming, et al.
Published: (2024)
Geneses: Unified Generative Speech Enhancement and Separation
by: Asai, Kohei, et al.
Published: (2026)
by: Asai, Kohei, et al.
Published: (2026)
Investigating Training Objectives for Generative Speech Enhancement
by: Richter, Julius, et al.
Published: (2024)
by: Richter, Julius, et al.
Published: (2024)
Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing
by: Ren, Wenze, et al.
Published: (2024)
by: Ren, Wenze, et al.
Published: (2024)
Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
by: Sun, Xingwei, et al.
Published: (2025)
by: Sun, Xingwei, et al.
Published: (2025)
Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement
by: Ren, Wenze, et al.
Published: (2024)
by: Ren, Wenze, et al.
Published: (2024)
LORT: Locally Refined Convolution and Taylor Transformer for Monaural Speech Enhancement
by: Wang, Junyu, et al.
Published: (2025)
by: Wang, Junyu, et al.
Published: (2025)
Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement
by: Yuan, Xihao, et al.
Published: (2025)
by: Yuan, Xihao, et al.
Published: (2025)
MeanFlowSE: One-Step Generative Speech Enhancement via MeanFlow
by: Zhu, Yike, et al.
Published: (2025)
by: Zhu, Yike, et al.
Published: (2025)
ICASSP 2026 URGENT Speech Enhancement Challenge
by: Li, Chenda, et al.
Published: (2026)
by: Li, Chenda, et al.
Published: (2026)
Few-step Adversarial Schrödinger Bridge for Generative Speech Enhancement
by: Han, Seungu, et al.
Published: (2025)
by: Han, Seungu, et al.
Published: (2025)
WTFormer: A Wavelet Conformer Network for MIMO Speech Enhancement with Spatial Cues Peservation
by: Han, Lu, et al.
Published: (2025)
by: Han, Lu, et al.
Published: (2025)
A Two-Stage Framework in Cross-Spectrum Domain for Real-Time Speech Enhancement
by: Zhang, Yuewei, et al.
Published: (2024)
by: Zhang, Yuewei, et al.
Published: (2024)
Leveraging Mamba with Full-Face Vision for Audio-Visual Speech Enhancement
by: Chao, Rong, et al.
Published: (2025)
by: Chao, Rong, et al.
Published: (2025)
Confidence-based Filtering for Speech Dataset Curation with Generative Speech Enhancement Using Discrete Tokens
by: Yamauchi, Kazuki, et al.
Published: (2026)
by: Yamauchi, Kazuki, et al.
Published: (2026)
A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement
by: Lu, Shenghui, et al.
Published: (2025)
by: Lu, Shenghui, et al.
Published: (2025)
Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement
by: Li, Chenda, et al.
Published: (2024)
by: Li, Chenda, et al.
Published: (2024)
Dual-View Predictive Diffusion: Lightweight Speech Enhancement via Spectrogram-Image Synergy
by: Xue, Ke, et al.
Published: (2026)
by: Xue, Ke, et al.
Published: (2026)
Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features
by: Tsunoo, Emiru, et al.
Published: (2024)
by: Tsunoo, Emiru, et al.
Published: (2024)
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
by: Lu, Ye-Xin, et al.
Published: (2023)
by: Lu, Ye-Xin, et al.
Published: (2023)
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
by: Wang, Chengzhong, et al.
Published: (2024)
by: Wang, Chengzhong, et al.
Published: (2024)
Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement
by: Nasretdinov, Rauf, et al.
Published: (2025)
by: Nasretdinov, Rauf, et al.
Published: (2025)
Similar Items
-
TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network
by: Rong, Xiaobin, et al.
Published: (2025) -
Adaptive Convolution for CNN-based Speech Enhancement Models
by: Wang, Dahan, et al.
Published: (2025) -
UniPASE: A Generative Model for Universal Speech Enhancement with High Fidelity and Low Hallucinations
by: Rong, Xiaobin, et al.
Published: (2026) -
Universal Speech Enhancement with Regression and Generative Mamba
by: Chao, Rong, et al.
Published: (2025) -
A Hybrid Discriminative and Generative System for Universal Speech Enhancement
by: Liu, Yinghao, et al.
Published: (2026)