:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gaznepoglu, Ünal Ege, Leschanowsky, Anna, Aloradi, Ahmad, Singh, Prachi, Tenbrinck, Daniel, Habets, Emanuël A. P., Peters, Nils
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Computation and Language
Online Access:	https://arxiv.org/abs/2506.09521
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VoxATtack: A Multimodal Attack on Voice Anonymization Systems
by: Aloradi, Ahmad, et al.
Published: (2025)

Why disentanglement-based speaker anonymization systems fail at preserving emotions?
by: Gaznepoglu, Ünal Ege, et al.
Published: (2025)

The Third VoicePrivacy Challenge: Preserving Emotional Expressiveness and Linguistic Content in Voice Anonymization
by: Tomashenko, Natalia, et al.
Published: (2026)

The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2024)

The First VoicePrivacy Attacker Challenge
by: Tomashenko, Natalia, et al.
Published: (2025)

The VoicePrivacy 2024 Challenge Evaluation Plan
by: Tomashenko, Natalia, et al.
Published: (2024)

The First VoicePrivacy Attacker Challenge Evaluation Plan
by: Tomashenko, Natalia, et al.
Published: (2024)

Benchmarking Neural Speech Codec Intelligibility with SITool
by: Leschanowsky, Anna, et al.
Published: (2025)

Examining the Interplay Between Privacy and Fairness for Speech Processing: A Review and Perspective
by: Leschanowsky, Anna, et al.
Published: (2024)

Robust Speech Activity Detection in the Presence of Singing Voice
by: Grundhuber, Philipp, et al.
Published: (2025)

VoiceSculptor: Your Voice, Designed By You
by: Hu, Jingbin, et al.
Published: (2026)

Sample Rate Offset Compensated Acoustic Echo Cancellation For Multi-Device Scenarios
by: Korse, Srikanth, et al.
Published: (2025)

Seeing What You Say: Expressive Image Generation from Speech
by: Lee, Jiyoung, et al.
Published: (2025)

Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement
by: Shetu, Shrishti Saha, et al.
Published: (2025)

Neural Directional Filtering with Configurable Directivity Pattern at Inference
by: Huang, Weilong, et al.
Published: (2025)

Navigating PESQ: Up-to-Date Versions and Open Implementations
by: Torcoli, Matteo, et al.
Published: (2025)

GAN-Based Multi-Microphone Spatial Target Speaker Extraction
by: Shetu, Shrishti Saha, et al.
Published: (2025)

Acoustic Teleportation via Disentangled Neural Audio Codec Representations
by: Grundhuber, Philipp, et al.
Published: (2025)

Dynamic Slimmable Networks for Efficient Speech Separation
by: Elminshawi, Mohamed, et al.
Published: (2025)

Stereo Reproduction in the Presence of Sample Rate Offsets
by: Korse, Srikanth, et al.
Published: (2025)

What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
by: Nguyen, Binh, et al.
Published: (2025)

Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios
by: Shetu, Shrishti Saha, et al.
Published: (2024)

On the Relation Between Speech Quality and Quantized Latent Representations of Neural Codecs
by: Halimeh, Mhd Modar, et al.
Published: (2025)

Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections
by: Xu, Zeyu, et al.
Published: (2026)

Training Strategies for Modality Dropout Resilient Multi-Modal Target Speaker Extraction
by: Korse, Srikanth, et al.
Published: (2025)

ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation
by: Halimeh, Mhd Modar, et al.
Published: (2024)

Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations
by: Götz, Philipp, et al.
Published: (2024)

NDF+: Joint Neural Directional Filtering and Diffuse Sound Extraction
by: Huang, Weilong, et al.
Published: (2026)

Data-driven Joint Detection and Localization of Acoustic Reflectors
by: Bicer, H. Nazim, et al.
Published: (2024)

Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron
by: Lakshminarayana, Kishor Kayyar, et al.
Published: (2025)

Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)

Voice Privacy Preservation with Multiple Random Orthogonal Secret Keys: Attack Resistance Analysis
by: Tanaka, Kohei, et al.
Published: (2025)

Neural Directional Filtering Using a Compact Microphone Array
by: Huang, Weilong, et al.
Published: (2025)

Matching Reverberant Speech Through Learned Acoustic Embeddings and Feedback Delay Networks
by: Götz, Philipp, et al.
Published: (2025)

GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
by: Shetu, Shrishti Saha, et al.
Published: (2024)

Audio-Visual Speech Enhancement for Spatial Audio - Spatial-VisualVoice and the MAVE Database
by: Yaffe, Danielle, et al.
Published: (2025)

Expanding and Analyzing ODAQ -- the Open Dataset of Audio Quality
by: Dick, Sascha, et al.
Published: (2025)

Neural Directional Filtering: Far-Field Directivity Control With a Small Microphone Array
by: Wechsler, Julian, et al.
Published: (2024)

I Know You're Listening: Adaptive Voice for HRI
by: Tuttösí, Paige
Published: (2025)

A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise Reduction
by: Shetu, Shrishti Saha, et al.
Published: (2024)