:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Naisong, Phaye, Saisamarth Rajesh, Cernak, Milos, Stojkovic, Tijana, Pearce, Andy, Cavallaro, Andrea, Harper, Andy
Format:	Preprint
Published:	2025
Subjects:	Sound Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.21522
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Model as Loss: A Self-Consistent Training Paradigm
by: Phaye, Saisamarth Rajesh, et al.
Published: (2025)

Multi-Channel MOSRA: Mean Opinion Score and Room Acoustics Estimation Using Simulated Data and a Teacher Model
by: Coldenhoff, Jozef, et al.
Published: (2023)

DeepFilterGAN: A Full-band Real-time Speech Enhancement System with GAN-based Stochastic Regeneration
by: Serbest, Sanberk, et al.
Published: (2025)

Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule
by: Wang, Siyi, et al.
Published: (2024)

Semi-intrusive audio evaluation: Casting non-intrusive assessment as a multi-modal text prediction task
by: Coldenhoff, Jozef, et al.
Published: (2024)

Differentiable Time-Varying IIR Filtering for Real-Time Speech Denoising
by: Rota, Riccardo, et al.
Published: (2026)

OpenACE: An Open Benchmark for Evaluating Audio Coding Performance
by: Coldenhoff, Jozef, et al.
Published: (2024)

Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement
by: Yang, Gang, et al.
Published: (2025)

Flowing Straighter with Conditional Flow Matching for Accurate Speech Enhancement
by: Cross, Mattias, et al.
Published: (2025)

FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching
by: Jung, Chaeyoung, et al.
Published: (2024)

MeanFlowSE: One-Step Generative Speech Enhancement via MeanFlow
by: Zhu, Yike, et al.
Published: (2025)

Generative Pre-training for Speech with Flow Matching
by: Liu, Alexander H., et al.
Published: (2023)

Latent-Level Enhancement with Flow Matching for Robust Automatic Speech Recognition
by: Yang, Da-Hee, et al.
Published: (2026)

DSFlow: Dual Supervision and Step-Aware Architecture for One-Step Flow Matching Speech Synthesis
by: Lin, Bin, et al.
Published: (2026)

Accelerating Flow-Matching-Based Text-to-Speech via Empirically Pruned Step Sampling
by: Zheng, Qixi, et al.
Published: (2025)

Improving Design of Input Condition Invariant Speech Enhancement
by: Zhang, Wangyou, et al.
Published: (2024)

Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching
by: Zuo, Jialong, et al.
Published: (2025)

DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers
by: Cao, Tianyu, et al.
Published: (2026)

Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation
by: Wang, Wupeng, et al.
Published: (2025)

Parallel Synthesis for Autoregressive Speech Generation
by: Hsu, Po-chun, et al.
Published: (2022)

Schrödinger Bridge Mamba for One-Step Speech Enhancement
by: Yang, Jing, et al.
Published: (2025)

Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders
by: Sun, Xingwei, et al.
Published: (2025)

Drax: Speech Recognition with Discrete Flow Matching
by: Navon, Aviv, et al.
Published: (2025)

Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition
by: Wang, Kuan-Chen, et al.
Published: (2024)

EDNet: A Versatile Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training
by: Kwak, Doyeop, et al.
Published: (2025)

FlowW2N: Whispered-to-Normal Speech Conversion via Flow-Matching
by: Ritter-Gutierrez, Fabian, et al.
Published: (2026)

Universal Discrete-Domain Speech Enhancement
by: Liu, Fei, et al.
Published: (2025)

CogSR: Semantic-Aware Speech Super-Resolution via Chain-of-Thought Guided Flow Matching
by: Yuan, Jiajun, et al.
Published: (2025)

Pre-training Feature Guided Diffusion Model for Speech Enhancement
by: Yang, Yiyuan, et al.
Published: (2024)

DialoSpeech: Dual-Speaker Dialogue Generation with LLM and Flow Matching
by: Xie, Hanke, et al.
Published: (2025)

VoiceRestore: Flow-Matching Transformers for Speech Recording Quality Restoration
by: Kirdey, Stanislav
Published: (2025)

Real-Time Streamable Generative Speech Restoration with Flow Matching
by: Welker, Simon, et al.
Published: (2025)

Multi-Channel Speech Enhancement for Cocktail Party Speech Emotion Recognition
by: Chen, Youjun, et al.
Published: (2026)

Diffusion-based Frameworks for Unsupervised Speech Enhancement
by: Ayilo, Jean-Eudes, et al.
Published: (2026)

RobustSpeechFlow: Learning Robust Text-to-Speech Trajectories via Augmentation-based Contrastive Flow Matching
by: Yang, Jinhyeok, et al.
Published: (2026)

Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis
by: Yang, Dong, et al.
Published: (2025)

Improved Intelligibility of Dysarthric Speech using Conditional Flow Matching
by: Das, Shoutrik, et al.
Published: (2025)

LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhancement
by: Yan, Haoyin, et al.
Published: (2025)

StreamFlow: Streaming Flow Matching with Block-wise Guided Attention Mask for Speech Token Decoding
by: Guo, Dake, et al.
Published: (2025)

VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching
by: Guo, Yiwei, et al.
Published: (2023)