Saved in:
| Main Authors: | Wu, Yihsuan, Chiu, Yukai, Anthony, Michael, Bai, Mingsian R. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.06310 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Visual-Informed Speech Enhancement Using Attention-Based Beamforming
by: Liu, Chihyun, et al.
Published: (2026)
by: Liu, Chihyun, et al.
Published: (2026)
A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes
by: Hsu, Yicheng, et al.
Published: (2024)
by: Hsu, Yicheng, et al.
Published: (2024)
Spatial-Temporal Activity-Informed Diarization and Separation
by: Hsu, Yicheng, et al.
Published: (2024)
by: Hsu, Yicheng, et al.
Published: (2024)
AmbiDrop: Array-Agnostic Speech Enhancement Using Ambisonics Encoding and Dropout-Based Learning
by: Tatarjitzky, Michael, et al.
Published: (2025)
by: Tatarjitzky, Michael, et al.
Published: (2025)
Attention-Based Beamformer For Multi-Channel Speech Enhancement
by: Bai, Jinglin, et al.
Published: (2024)
by: Bai, Jinglin, et al.
Published: (2024)
Interpreting End-to-End Deep Learning Models for Speech Source Localization Using Layer-wise Relevance Propagation
by: Comanducci, Luca, et al.
Published: (2024)
by: Comanducci, Luca, et al.
Published: (2024)
Speech Quality-Based Localization of Low-Quality Speech and Text-to-Speech Synthesis Artefacts
by: Kuhlmann, Michael, et al.
Published: (2026)
by: Kuhlmann, Michael, et al.
Published: (2026)
SELM: Speech Enhancement Using Discrete Tokens and Language Models
by: Wang, Ziqian, et al.
Published: (2023)
by: Wang, Ziqian, et al.
Published: (2023)
Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement
by: Sutherland, Robert, et al.
Published: (2024)
by: Sutherland, Robert, et al.
Published: (2024)
TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
by: Saijo, Kohei, et al.
Published: (2024)
by: Saijo, Kohei, et al.
Published: (2024)
Enhancement of Dysarthric Speech Reconstruction by Contrastive Learning
by: Fatemeh, Keshvari, et al.
Published: (2024)
by: Fatemeh, Keshvari, et al.
Published: (2024)
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
by: Chiang, Hsin-Tien, et al.
Published: (2024)
by: Chiang, Hsin-Tien, et al.
Published: (2024)
An Exploration of Length Generalization in Transformer-Based Speech Enhancement
by: Zhang, Qiquan, et al.
Published: (2024)
by: Zhang, Qiquan, et al.
Published: (2024)
CosyAccent: Duration-Controllable Accent Normalization Using Source-Synthesis Training Data
by: Bai, Qibing, et al.
Published: (2026)
by: Bai, Qibing, et al.
Published: (2026)
A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement
by: Zhang, Jie, et al.
Published: (2025)
by: Zhang, Jie, et al.
Published: (2025)
EffortNet: A Deep Learning Framework for Objective Assessment of Speech Enhancement Technologies Using EEG-Based Alpha Oscillations
by: Sung, Ching-Chih, et al.
Published: (2025)
by: Sung, Ching-Chih, et al.
Published: (2025)
A Hybrid Discriminative and Generative System for Universal Speech Enhancement
by: Liu, Yinghao, et al.
Published: (2026)
by: Liu, Yinghao, et al.
Published: (2026)
Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics
by: Wang, Syu-Siang, et al.
Published: (2024)
by: Wang, Syu-Siang, et al.
Published: (2024)
Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement
by: Nasretdinov, Rauf, et al.
Published: (2025)
by: Nasretdinov, Rauf, et al.
Published: (2025)
TripleC Learning and Lightweight Speech Enhancement for Multi-Condition Target Speech Extraction
by: Huang, Ziling
Published: (2025)
by: Huang, Ziling
Published: (2025)
Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks
by: Tokala, Vikas, et al.
Published: (2024)
by: Tokala, Vikas, et al.
Published: (2024)
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition
by: Guo, Zilu, et al.
Published: (2024)
by: Guo, Zilu, et al.
Published: (2024)
Rethinking Flow and Diffusion Bridge Models for Speech Enhancement
by: Wang, Dahan, et al.
Published: (2026)
by: Wang, Dahan, et al.
Published: (2026)
Selective State Space Model for Monaural Speech Enhancement
by: Chen, Moran, et al.
Published: (2024)
by: Chen, Moran, et al.
Published: (2024)
Influence of Clean Speech Characteristics on Speech Enhancement Performance
by: Hou, Mingchi, et al.
Published: (2025)
by: Hou, Mingchi, et al.
Published: (2025)
Jointly Recognizing Speech and Singing Voices Based on Multi-Task Audio Source Separation
by: Bai, Ye, et al.
Published: (2024)
by: Bai, Ye, et al.
Published: (2024)
A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions
by: Wang, Zheng, et al.
Published: (2025)
by: Wang, Zheng, et al.
Published: (2025)
Learning Time-Graph Frequency Representation for Monaural Speech Enhancement
by: Wang, Tingting, et al.
Published: (2025)
by: Wang, Tingting, et al.
Published: (2025)
HyBeam: Hybrid Microphone-Beamforming Array-Agnostic Speech Enhancement for Wearables
by: Ilan, Yuval Bar, et al.
Published: (2025)
by: Ilan, Yuval Bar, et al.
Published: (2025)
Objective and Subjective Evaluation of Diffusion-Based Speech Enhancement for Dysarthric Speech
by: de Groot, Dimme, et al.
Published: (2025)
by: de Groot, Dimme, et al.
Published: (2025)
Direction-Preserving MIMO Speech Enhancement Using a Neural Covariance Estimator
by: Deppisch, Thomas
Published: (2026)
by: Deppisch, Thomas
Published: (2026)
Binaural Speech Enhancement Using Complex Convolutional Recurrent Networks
by: Tokala, Vikas, et al.
Published: (2025)
by: Tokala, Vikas, et al.
Published: (2025)
Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement
by: Shetu, Shrishti Saha, et al.
Published: (2025)
by: Shetu, Shrishti Saha, et al.
Published: (2025)
Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement
by: Zheng, Tianqin, et al.
Published: (2025)
by: Zheng, Tianqin, et al.
Published: (2025)
A Semi-spontaneous Dutch Speech Dataset for Speech Enhancement and Speech Recognition
by: de Groot, Dimme, et al.
Published: (2026)
by: de Groot, Dimme, et al.
Published: (2026)
Stack Less, Repeat More: A Block Reusing Approach for Progressive Speech Enhancement
by: Kim, Jangyeon, et al.
Published: (2025)
by: Kim, Jangyeon, et al.
Published: (2025)
P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge
by: Sach, Marvin, et al.
Published: (2025)
by: Sach, Marvin, et al.
Published: (2025)
Generalizability of Predictive and Generative Speech Enhancement Models to Pathological Speakers
by: Hou, Mingchi, et al.
Published: (2025)
by: Hou, Mingchi, et al.
Published: (2025)
LORT: Locally Refined Convolution and Taylor Transformer for Monaural Speech Enhancement
by: Wang, Junyu, et al.
Published: (2025)
by: Wang, Junyu, et al.
Published: (2025)
Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)
by: Behringer, Lyonel, et al.
Published: (2026)
Similar Items
-
Visual-Informed Speech Enhancement Using Attention-Based Beamforming
by: Liu, Chihyun, et al.
Published: (2026) -
A tunable binaural audio telepresence system capable of balancing immersive and enhanced modes
by: Hsu, Yicheng, et al.
Published: (2024) -
Spatial-Temporal Activity-Informed Diarization and Separation
by: Hsu, Yicheng, et al.
Published: (2024) -
AmbiDrop: Array-Agnostic Speech Enhancement Using Ambisonics Encoding and Dropout-Based Learning
by: Tatarjitzky, Michael, et al.
Published: (2025) -
Attention-Based Beamformer For Multi-Channel Speech Enhancement
by: Bai, Jinglin, et al.
Published: (2024)