Saved in:
| Main Authors: | Lee, Ching-Hua, Yang, Chouchang, Cho, Jaejin, Saidutta, Yashas Malur, Srinivasa, Rakshith Sharma, Shen, Yilin, Jin, Hongxia |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.13574 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion
by: Švento, Michal, et al.
Published: (2025)
by: Švento, Michal, et al.
Published: (2025)
A Diffusion-Based Generative Equalizer for Music Restoration
by: Moliner, Eloi, et al.
Published: (2024)
by: Moliner, Eloi, et al.
Published: (2024)
A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration
by: Lay, Bunlong, et al.
Published: (2026)
by: Lay, Bunlong, et al.
Published: (2026)
VoiceRestore: Flow-Matching Transformers for Speech Recording Quality Restoration
by: Kirdey, Stanislav
Published: (2025)
by: Kirdey, Stanislav
Published: (2025)
Diffusion Models for Audio Restoration
by: Lemercier, Jean-Marie, et al.
Published: (2024)
by: Lemercier, Jean-Marie, et al.
Published: (2024)
Towards Real-Time Generative Speech Restoration with Flow-Matching
by: Hsieh, Tsun-An, et al.
Published: (2025)
by: Hsieh, Tsun-An, et al.
Published: (2025)
Voice-ENHANCE: Speech Restoration using a Diffusion-based Voice Conversion Framework
by: Byun, Kyungguen, et al.
Published: (2025)
by: Byun, Kyungguen, et al.
Published: (2025)
Real-time Speech Restoration using Data Prediction Mean Flows
by: Braun, Sebastian
Published: (2026)
by: Braun, Sebastian
Published: (2026)
FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
by: Nguyen, Tan Dat, et al.
Published: (2024)
by: Nguyen, Tan Dat, et al.
Published: (2024)
Query-Based Asymmetric Modeling with Decoupled Input-Output Rates for Speech Restoration
by: Shin, Ui-Hyeop, et al.
Published: (2025)
by: Shin, Ui-Hyeop, et al.
Published: (2025)
On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation
by: Meise, Adrian, et al.
Published: (2025)
by: Meise, Adrian, et al.
Published: (2025)
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
by: Kameoka, Hirokazu, et al.
Published: (2025)
by: Kameoka, Hirokazu, et al.
Published: (2025)
The CCF AATC 2025 Speech Restoration Challenge: A Retrospective
by: Zhang, Junan, et al.
Published: (2025)
by: Zhang, Junan, et al.
Published: (2025)
Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility
by: Liu, Xiaoyu, et al.
Published: (2024)
by: Liu, Xiaoyu, et al.
Published: (2024)
CFMDCTCodec: A Low-Bitrate Neural Speech Codec with Noise-Prior-aware Conditional Flow Matching for MDCT-Spectral Enhancement
by: Jiang, Xiao-Hang, et al.
Published: (2026)
by: Jiang, Xiao-Hang, et al.
Published: (2026)
Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
by: Cheddad, Zohra Adila, et al.
Published: (2021)
by: Cheddad, Zohra Adila, et al.
Published: (2021)
Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections
by: Xu, Zeyu, et al.
Published: (2026)
by: Xu, Zeyu, et al.
Published: (2026)
ProSE: Diffusion Priors for Speech Enhancement
by: Kumar, Sonal, et al.
Published: (2025)
by: Kumar, Sonal, et al.
Published: (2025)
Language model integration based on memory control for sequence to sequence speech recognition
by: Cho, Jaejin, et al.
Published: (2018)
by: Cho, Jaejin, et al.
Published: (2018)
VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion
by: Byun, Kyungguen, et al.
Published: (2024)
by: Byun, Kyungguen, et al.
Published: (2024)
FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
by: Yang, Da-Hee, et al.
Published: (2025)
by: Yang, Da-Hee, et al.
Published: (2025)
ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
by: Nakata, Wataru, et al.
Published: (2025)
by: Nakata, Wataru, et al.
Published: (2025)
Evaluation of an ITD-to-ILD Transformation as a Method to Restore the Spatial Benefit in Speech Intelligibility in Hearing Impaired Listeners
by: Bäumer, Timm-Jonas, et al.
Published: (2025)
by: Bäumer, Timm-Jonas, et al.
Published: (2025)
Ring Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech Separation
by: Maciejewski, Matthew, et al.
Published: (2026)
by: Maciejewski, Matthew, et al.
Published: (2026)
MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
by: Lu, Ye-Xin, et al.
Published: (2023)
by: Lu, Ye-Xin, et al.
Published: (2023)
Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
by: Chiang, Hsin-Tien, et al.
Published: (2024)
by: Chiang, Hsin-Tien, et al.
Published: (2024)
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration
by: Ku, Pin-Jui, et al.
Published: (2024)
by: Ku, Pin-Jui, et al.
Published: (2024)
Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation
by: Chung, Soo-Whan, et al.
Published: (2025)
by: Chung, Soo-Whan, et al.
Published: (2025)
Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
by: Karita, Shigeki, et al.
Published: (2025)
by: Karita, Shigeki, et al.
Published: (2025)
Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis
by: Wang, Junnuo
Published: (2025)
by: Wang, Junnuo
Published: (2025)
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
by: Liu, Haocheng, et al.
Published: (2024)
by: Liu, Haocheng, et al.
Published: (2024)
GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis
by: Baoueb, Teysir, et al.
Published: (2025)
by: Baoueb, Teysir, et al.
Published: (2025)
Towards HRTF Personalization using Denoising Diffusion Models
by: Sánchez, Juan Camilo Albarracín, et al.
Published: (2025)
by: Sánchez, Juan Camilo Albarracín, et al.
Published: (2025)
Complex Image-Generative Diffusion Transformer for Audio Denoising
by: Li, Junhui, et al.
Published: (2024)
by: Li, Junhui, et al.
Published: (2024)
A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions
by: Du, Hui-Peng, et al.
Published: (2024)
by: Du, Hui-Peng, et al.
Published: (2024)
RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement
by: Liu, Mingshuai, et al.
Published: (2024)
by: Liu, Mingshuai, et al.
Published: (2024)
A Dual-Branch Parallel Network for Speech Enhancement and Restoration
by: Yang, Da-Hee, et al.
Published: (2024)
by: Yang, Da-Hee, et al.
Published: (2024)
DTT-BSR: GAN-based DTTNet with RoPE Transformer Enhancement for Music Source Restoration
by: Tan, Shihong, et al.
Published: (2026)
by: Tan, Shihong, et al.
Published: (2026)
SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns
by: Lee, Yongjoon, et al.
Published: (2026)
by: Lee, Yongjoon, et al.
Published: (2026)
Sidon: Fast and Robust Open-Source Multilingual Speech Restoration for Large-scale Dataset Cleansing
by: Nakata, Wataru, et al.
Published: (2025)
by: Nakata, Wataru, et al.
Published: (2025)
Similar Items
-
Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion
by: Švento, Michal, et al.
Published: (2025) -
A Diffusion-Based Generative Equalizer for Music Restoration
by: Moliner, Eloi, et al.
Published: (2024) -
A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration
by: Lay, Bunlong, et al.
Published: (2026) -
VoiceRestore: Flow-Matching Transformers for Speech Recording Quality Restoration
by: Kirdey, Stanislav
Published: (2025) -
Diffusion Models for Audio Restoration
by: Lemercier, Jean-Marie, et al.
Published: (2024)