Saved in:
| Main Authors: | Zhao, Haixin, Yang, Kaixuan, Madhu, Nilesh |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.11395 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Study of Lightweight Transformer Architectures for Single-Channel Speech Enhancement
by: Zhao, Haixin, et al.
Published: (2025)
by: Zhao, Haixin, et al.
Published: (2025)
Dynamic Slimmable Networks for Efficient Speech Separation
by: Elminshawi, Mohamed, et al.
Published: (2025)
by: Elminshawi, Mohamed, et al.
Published: (2025)
Enhanced Deep Speech Separation in Clustered Ad Hoc Distributed Microphone Environments
by: Kim, Jihyun, et al.
Published: (2024)
by: Kim, Jihyun, et al.
Published: (2024)
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
by: Huang, Ziling, et al.
Published: (2025)
by: Huang, Ziling, et al.
Published: (2025)
Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
by: Close, George, et al.
Published: (2024)
by: Close, George, et al.
Published: (2024)
SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation
by: Huang, Ziling, et al.
Published: (2025)
by: Huang, Ziling, et al.
Published: (2025)
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics
by: Wang, Syu-Siang, et al.
Published: (2024)
by: Wang, Syu-Siang, et al.
Published: (2024)
Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network
by: Chen, Yanan, et al.
Published: (2024)
by: Chen, Yanan, et al.
Published: (2024)
Tracking Listener Attention: Gaze-Guided Audio-Visual Speech Enhancement Framework
by: Yang, Hsiang-Cheng, et al.
Published: (2026)
by: Yang, Hsiang-Cheng, et al.
Published: (2026)
Investigating Training Objectives for Generative Speech Enhancement
by: Richter, Julius, et al.
Published: (2024)
by: Richter, Julius, et al.
Published: (2024)
FlowSE-GRPO: Training Flow Matching Speech Enhancement via Online Reinforcement Learning
by: Wang, Haoxu, et al.
Published: (2026)
by: Wang, Haoxu, et al.
Published: (2026)
Enroll-on-Wakeup: A First Comparative Study of Target Speech Extraction for Seamless Interaction in Real Noisy Human-Machine Dialogue Scenarios
by: Yang, Yiming, et al.
Published: (2026)
by: Yang, Yiming, et al.
Published: (2026)
Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment
by: Wang, Wei, et al.
Published: (2025)
by: Wang, Wei, et al.
Published: (2025)
The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
by: Gonzalez, Philippe, et al.
Published: (2024)
by: Gonzalez, Philippe, et al.
Published: (2024)
PLDNet: PLD-Guided Lightweight Deep Network Boosted by Efficient Attention for Handheld Dual-Microphone Speech Enhancement
by: Zhou, Nan, et al.
Published: (2024)
by: Zhou, Nan, et al.
Published: (2024)
Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
by: Tokala, Vikas, et al.
Published: (2024)
by: Tokala, Vikas, et al.
Published: (2024)
Influence of Clean Speech Characteristics on Speech Enhancement Performance
by: Hou, Mingchi, et al.
Published: (2025)
by: Hou, Mingchi, et al.
Published: (2025)
Dynamic Frequency-Adaptive Knowledge Distillation for Speech Enhancement
by: Yuan, Xihao, et al.
Published: (2025)
by: Yuan, Xihao, et al.
Published: (2025)
WTFormer: A Wavelet Conformer Network for MIMO Speech Enhancement with Spatial Cues Peservation
by: Han, Lu, et al.
Published: (2025)
by: Han, Lu, et al.
Published: (2025)
GALD-SE: Guided Anisotropic Lightweight Diffusion for Efficient Speech Enhancement
by: Wang, Chengzhong, et al.
Published: (2024)
by: Wang, Chengzhong, et al.
Published: (2024)
Test-Time Adaptation For Speech Enhancement Via Mask Polarization
by: Raichle, Tobias, et al.
Published: (2026)
by: Raichle, Tobias, et al.
Published: (2026)
A Semi-spontaneous Dutch Speech Dataset for Speech Enhancement and Speech Recognition
by: de Groot, Dimme, et al.
Published: (2026)
by: de Groot, Dimme, et al.
Published: (2026)
Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)
by: Behringer, Lyonel, et al.
Published: (2026)
Long-Context Modeling Networks for Monaural Speech Enhancement: A Comparative Study
by: Zhang, Qiquan, et al.
Published: (2025)
by: Zhang, Qiquan, et al.
Published: (2025)
Binaural Speech Enhancement Using Complex Convolutional Recurrent Networks
by: Tokala, Vikas, et al.
Published: (2025)
by: Tokala, Vikas, et al.
Published: (2025)
Data Augmentation for Pathological Speech Enhancement
by: Hou, Mingchi, et al.
Published: (2026)
by: Hou, Mingchi, et al.
Published: (2026)
Schrödinger Bridge for Generative Speech Enhancement
by: Jukić, Ante, et al.
Published: (2024)
by: Jukić, Ante, et al.
Published: (2024)
Distributed Asynchronous Device Speech Enhancement via Windowed Cross-Attention
by: Yang, Gene-Ping, et al.
Published: (2025)
by: Yang, Gene-Ping, et al.
Published: (2025)
Test-Time Training for Speech Enhancement
by: Behera, Avishkar, et al.
Published: (2025)
by: Behera, Avishkar, et al.
Published: (2025)
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN
by: Zhang, Shiqi, et al.
Published: (2024)
by: Zhang, Shiqi, et al.
Published: (2024)
UL-UNAS: Ultra-Lightweight U-Nets for Real-Time Speech Enhancement via Network Architecture Search
by: Rong, Xiaobin, et al.
Published: (2025)
by: Rong, Xiaobin, et al.
Published: (2025)
FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks
by: Lei, Tong, et al.
Published: (2025)
by: Lei, Tong, et al.
Published: (2025)
MeanSE: Efficient Generative Speech Enhancement with Mean Flows
by: Wang, Jiahe, et al.
Published: (2025)
by: Wang, Jiahe, et al.
Published: (2025)
CMGAN: Conformer-based Metric GAN for Speech Enhancement
by: Cao, Ruizhe, et al.
Published: (2022)
by: Cao, Ruizhe, et al.
Published: (2022)
Interspeech 2025 URGENT Speech Enhancement Challenge
by: Saijo, Kohei, et al.
Published: (2025)
by: Saijo, Kohei, et al.
Published: (2025)
ProSE: Diffusion Priors for Speech Enhancement
by: Kumar, Sonal, et al.
Published: (2025)
by: Kumar, Sonal, et al.
Published: (2025)
Test-Time Adaptation for Speech Enhancement via Domain Invariant Embedding Transformation
by: Raichle, Tobias, et al.
Published: (2025)
by: Raichle, Tobias, et al.
Published: (2025)
Enhancement of Dysarthric Speech Reconstruction by Contrastive Learning
by: Fatemeh, Keshvari, et al.
Published: (2024)
by: Fatemeh, Keshvari, et al.
Published: (2024)
On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement
by: López-Espejo, Iván, et al.
Published: (2024)
by: López-Espejo, Iván, et al.
Published: (2024)
A Dual-Branch Parallel Network for Speech Enhancement and Restoration
by: Yang, Da-Hee, et al.
Published: (2024)
by: Yang, Da-Hee, et al.
Published: (2024)
Similar Items
-
Study of Lightweight Transformer Architectures for Single-Channel Speech Enhancement
by: Zhao, Haixin, et al.
Published: (2025) -
Dynamic Slimmable Networks for Efficient Speech Separation
by: Elminshawi, Mohamed, et al.
Published: (2025) -
Enhanced Deep Speech Separation in Clustered Ad Hoc Distributed Microphone Environments
by: Kim, Jihyun, et al.
Published: (2024) -
Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement
by: Huang, Ziling, et al.
Published: (2025) -
Hallucination in Perceptual Metric-Driven Speech Enhancement Networks
by: Close, George, et al.
Published: (2024)