:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Ching-Hua, Yang, Chouchang, Cho, Jaejin, Saidutta, Yashas Malur, Srinivasa, Rakshith Sharma, Shen, Yilin, Jin, Hongxia
Format:	Preprint
Published:	2025
Subjects:	Image and Video Processing Machine Learning Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2502.13574
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion
by: Švento, Michal, et al.
Published: (2025)

A Diffusion-Based Generative Equalizer for Music Restoration
by: Moliner, Eloi, et al.
Published: (2024)

A Fast Solver for Interpolating Stochastic Differential Equation Diffusion Models for Speech Restoration
by: Lay, Bunlong, et al.
Published: (2026)

VoiceRestore: Flow-Matching Transformers for Speech Recording Quality Restoration
by: Kirdey, Stanislav
Published: (2025)

Diffusion Models for Audio Restoration
by: Lemercier, Jean-Marie, et al.
Published: (2024)

Towards Real-Time Generative Speech Restoration with Flow-Matching
by: Hsieh, Tsun-An, et al.
Published: (2025)

Voice-ENHANCE: Speech Restoration using a Diffusion-based Voice Conversion Framework
by: Byun, Kyungguen, et al.
Published: (2025)

Real-time Speech Restoration using Data Prediction Mean Flows
by: Braun, Sebastian
Published: (2026)

FreGrad: Lightweight and Fast Frequency-aware Diffusion Vocoder
by: Nguyen, Tan Dat, et al.
Published: (2024)

Query-Based Asymmetric Modeling with Decoupled Input-Output Rates for Speech Restoration
by: Shin, Ui-Hyeop, et al.
Published: (2025)

On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation
by: Meise, Adrian, et al.
Published: (2025)

LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
by: Kameoka, Hirokazu, et al.
Published: (2025)

The CCF AATC 2025 Speech Restoration Challenge: A Retrospective
by: Zhang, Junan, et al.
Published: (2025)

Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility
by: Liu, Xiaoyu, et al.
Published: (2024)

CFMDCTCodec: A Low-Bitrate Neural Speech Codec with Noise-Prior-aware Conditional Flow Matching for MDCT-Spectral Enhancement
by: Jiang, Xiao-Hang, et al.
Published: (2026)

Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
by: Cheddad, Zohra Adila, et al.
Published: (2021)

Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections
by: Xu, Zeyu, et al.
Published: (2026)

ProSE: Diffusion Priors for Speech Enhancement
by: Kumar, Sonal, et al.
Published: (2025)

Language model integration based on memory control for sequence to sequence speech recognition
by: Cho, Jaejin, et al.
Published: (2018)

VC-ENHANCE: Speech Restoration with Integrated Noise Suppression and Voice Conversion
by: Byun, Kyungguen, et al.
Published: (2024)

FLOWER: Flow-Based Estimated Gaussian Guidance for General Speech Restoration
by: Yang, Da-Hee, et al.
Published: (2025)

ReverbMiipher: Generative Speech Restoration meets Reverberation Characteristics Controllability
by: Nakata, Wataru, et al.
Published: (2025)

Evaluation of an ITD-to-ILD Transformation as a Method to Restore the Spatial Benefit in Speech Intelligibility in Hearing Impaired Listeners
by: Bäumer, Timm-Jonas, et al.
Published: (2025)

Ring Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech Separation
by: Maciejewski, Matthew, et al.
Published: (2026)

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra
by: Lu, Ye-Xin, et al.
Published: (2023)

Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules
by: Chiang, Hsin-Tien, et al.
Published: (2024)

Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration
by: Ku, Pin-Jui, et al.
Published: (2024)

Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation
by: Chung, Soo-Whan, et al.
Published: (2025)

Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration
by: Karita, Shigeki, et al.
Published: (2025)

Audio Palette: A Diffusion Transformer with Multi-Signal Conditioning for Controllable Foley Synthesis
by: Wang, Junnuo
Published: (2025)

GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model
by: Liu, Haocheng, et al.
Published: (2024)

GLA-Grad++: An Improved Griffin-Lim Guided Diffusion Model for Speech Synthesis
by: Baoueb, Teysir, et al.
Published: (2025)

Towards HRTF Personalization using Denoising Diffusion Models
by: Sánchez, Juan Camilo Albarracín, et al.
Published: (2025)

Complex Image-Generative Diffusion Transformer for Audio Denoising
by: Li, Junhui, et al.
Published: (2024)

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions
by: Du, Hui-Peng, et al.
Published: (2024)

RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement
by: Liu, Mingshuai, et al.
Published: (2024)

A Dual-Branch Parallel Network for Speech Enhancement and Restoration
by: Yang, Da-Hee, et al.
Published: (2024)

DTT-BSR: GAN-based DTTNet with RoPE Transformer Enhancement for Music Source Restoration
by: Tan, Shihong, et al.
Published: (2026)

SEMamba++: A General Speech Restoration Framework Leveraging Global, Local, and Periodic Spectral Patterns
by: Lee, Yongjoon, et al.
Published: (2026)

Sidon: Fast and Robust Open-Source Multilingual Speech Restoration for Large-scale Dataset Cleansing
by: Nakata, Wataru, et al.
Published: (2025)