:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Lemercier, Jean-Marie, Richter, Julius, Welker, Simon, Moliner, Eloi, Välimäki, Vesa, Gerkmann, Timo
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Audio and Speech Processing Machine Learning Sound
Accesso online:	https://arxiv.org/abs/2402.09821
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models
di: Moliner, Eloi, et al.
Pubblicazione: (2024)

Unsupervised Blind Joint Dereverberation and Room Acoustics Estimation with Diffusion Models
di: Lemercier, Jean-Marie, et al.
Pubblicazione: (2024)

HRTF Estimation using a Score-based Prior
di: Thuillier, Etienne, et al.
Pubblicazione: (2024)

StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation
di: Lemercier, Jean-Marie, et al.
Pubblicazione: (2022)

Speech Enhancement and Dereverberation with Diffusion-based Generative Models
di: Richter, Julius, et al.
Pubblicazione: (2022)

Non-intrusive Speech Quality Assessment with Diffusion Models Trained on Clean Speech
di: de Oliveira, Danilo, et al.
Pubblicazione: (2024)

Blind Audio Bandwidth Extension: A Diffusion-Based Zero-Shot Approach
di: Moliner, Eloi, et al.
Pubblicazione: (2023)

Diffusion-Based Audio Inpainting
di: Moliner, Eloi, et al.
Pubblicazione: (2023)

Similarity-Guided Diffusion for Long-Gap Music Inpainting
di: Turland, Sean, et al.
Pubblicazione: (2025)

Single and Few-step Diffusion for Generative Speech Enhancement
di: Lay, Bunlong, et al.
Pubblicazione: (2023)

A Diffusion-Based Generative Equalizer for Music Restoration
di: Moliner, Eloi, et al.
Pubblicazione: (2024)

Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model
di: Lemercier, Jean-Marie, et al.
Pubblicazione: (2023)

The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement
di: de Oliveira, Danilo, et al.
Pubblicazione: (2024)

Real-Time Streaming Mel Vocoding with Generative Flow Matching
di: Welker, Simon, et al.
Pubblicazione: (2025)

Diffusion Buffer for Online Generative Speech Enhancement
di: Lay, Bunlong, et al.
Pubblicazione: (2025)

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
di: Richter, Julius, et al.
Pubblicazione: (2024)

Gaussian Flow Bridges for Audio Domain Transfer with Unpaired Data
di: Moliner, Eloi, et al.
Pubblicazione: (2024)

An Analysis of the Variance of Diffusion-based Speech Enhancement
di: Lay, Bunlong, et al.
Pubblicazione: (2024)

Resampling Filter Design for Multirate Neural Audio Effect Processing
di: Carson, Alistair, et al.
Pubblicazione: (2025)

Bone-conduction Guided Multimodal Speech Enhancement with Conditional Diffusion Models
di: Khanagha, Sina, et al.
Pubblicazione: (2026)

LipDiffuser: Lip-to-Speech Generation with Conditional Diffusion Models
di: Richter, Julius, et al.
Pubblicazione: (2025)

Do We Need EMA for Diffusion-Based Speech Enhancement? Toward a Magnitude-Preserving Network Architecture
di: Richter, Julius, et al.
Pubblicazione: (2025)

ReverbFX: A Dataset of Room Impulse Responses Derived from Reverb Effect Plugins for Singing Voice Dereverberation
di: Richter, Julius, et al.
Pubblicazione: (2025)

Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters
di: Tesch, Kristina, et al.
Pubblicazione: (2023)

Steering Deep Non-Linear Spatially Selective Filters for Weakly Guided Extraction of Moving Speakers in Dynamic Scenarios
di: Kienegger, Jakob, et al.
Pubblicazione: (2025)

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers
di: Kienegger, Jakob, et al.
Pubblicazione: (2026)

Adaptive Rotary Steering with Joint Autoregression for Robust Extraction of Closely Moving Speakers in Dynamic Scenarios
di: Kienegger, Jakob, et al.
Pubblicazione: (2026)

Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion
di: Švento, Michal, et al.
Pubblicazione: (2025)

Investigating Training Objectives for Generative Speech Enhancement
di: Richter, Julius, et al.
Pubblicazione: (2024)

FlowDec: A flow-based full-band general audio codec with high perceptual quality
di: Welker, Simon, et al.
Pubblicazione: (2025)

Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation
di: Kienegger, Jakob, et al.
Pubblicazione: (2024)

Are Modern Speech Enhancement Systems Vulnerable to Adversarial Attacks?
di: Makarov, Rostislav, et al.
Pubblicazione: (2025)

An Octave-based Multi-Resolution CQT Architecture for Diffusion-based Audio Generation
di: da Costa, Maurício do V. M., et al.
Pubblicazione: (2025)

Automatic Music Mixing using a Generative Model of Effect Embeddings
di: Moliner, Eloi, et al.
Pubblicazione: (2025)

EMOCONV-DIFF: Diffusion-based Speech Emotion Conversion for Non-parallel and In-the-wild Data
di: Prabhu, Navin Raj, et al.
Pubblicazione: (2023)

Self-Steering Deep Non-Linear Spatially Selective Filters for Efficient Extraction of Moving Speakers under Weak Guidance
di: Kienegger, Jakob, et al.
Pubblicazione: (2025)

An Independence-promoting Loss for Music Generation with Language Models
di: Lemercier, Jean-Marie, et al.
Pubblicazione: (2024)

A Method for Capturing and Reproducing Directional Reverberation in Six Degrees of Freedom
di: Alary, Benoit, et al.
Pubblicazione: (2021)

Sample Rate Independent Recurrent Neural Networks for Audio Effects Processing
di: Carson, Alistair, et al.
Pubblicazione: (2024)

Unsupervised Estimation of Nonlinear Audio Effects: Comparing Diffusion-Based and Adversarial approaches
di: Moliner, Eloi, et al.
Pubblicazione: (2025)