Saved in:
| Main Authors: | Gaznepoglu, Ünal Ege, Leschanowsky, Anna, Aloradi, Ahmad, Singh, Prachi, Tenbrinck, Daniel, Habets, Emanuël A. P., Peters, Nils |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.09521 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VoxATtack: A Multimodal Attack on Voice Anonymization Systems
by: Aloradi, Ahmad, et al.
Published: (2025)
by: Aloradi, Ahmad, et al.
Published: (2025)
Why disentanglement-based speaker anonymization systems fail at preserving emotions?
by: Gaznepoglu, Ünal Ege, et al.
Published: (2025)
by: Gaznepoglu, Ünal Ege, et al.
Published: (2025)
The Third VoicePrivacy Challenge: Preserving Emotional Expressiveness and Linguistic Content in Voice Anonymization
by: Tomashenko, Natalia, et al.
Published: (2026)
by: Tomashenko, Natalia, et al.
Published: (2026)
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2024)
by: Panariello, Michele, et al.
Published: (2024)
The First VoicePrivacy Attacker Challenge
by: Tomashenko, Natalia, et al.
Published: (2025)
by: Tomashenko, Natalia, et al.
Published: (2025)
The VoicePrivacy 2024 Challenge Evaluation Plan
by: Tomashenko, Natalia, et al.
Published: (2024)
by: Tomashenko, Natalia, et al.
Published: (2024)
The First VoicePrivacy Attacker Challenge Evaluation Plan
by: Tomashenko, Natalia, et al.
Published: (2024)
by: Tomashenko, Natalia, et al.
Published: (2024)
Benchmarking Neural Speech Codec Intelligibility with SITool
by: Leschanowsky, Anna, et al.
Published: (2025)
by: Leschanowsky, Anna, et al.
Published: (2025)
Examining the Interplay Between Privacy and Fairness for Speech Processing: A Review and Perspective
by: Leschanowsky, Anna, et al.
Published: (2024)
by: Leschanowsky, Anna, et al.
Published: (2024)
Robust Speech Activity Detection in the Presence of Singing Voice
by: Grundhuber, Philipp, et al.
Published: (2025)
by: Grundhuber, Philipp, et al.
Published: (2025)
VoiceSculptor: Your Voice, Designed By You
by: Hu, Jingbin, et al.
Published: (2026)
by: Hu, Jingbin, et al.
Published: (2026)
Sample Rate Offset Compensated Acoustic Echo Cancellation For Multi-Device Scenarios
by: Korse, Srikanth, et al.
Published: (2025)
by: Korse, Srikanth, et al.
Published: (2025)
Seeing What You Say: Expressive Image Generation from Speech
by: Lee, Jiyoung, et al.
Published: (2025)
by: Lee, Jiyoung, et al.
Published: (2025)
Leveraging Discriminative Latent Representations for Conditioning GAN-Based Speech Enhancement
by: Shetu, Shrishti Saha, et al.
Published: (2025)
by: Shetu, Shrishti Saha, et al.
Published: (2025)
Neural Directional Filtering with Configurable Directivity Pattern at Inference
by: Huang, Weilong, et al.
Published: (2025)
by: Huang, Weilong, et al.
Published: (2025)
Navigating PESQ: Up-to-Date Versions and Open Implementations
by: Torcoli, Matteo, et al.
Published: (2025)
by: Torcoli, Matteo, et al.
Published: (2025)
GAN-Based Multi-Microphone Spatial Target Speaker Extraction
by: Shetu, Shrishti Saha, et al.
Published: (2025)
by: Shetu, Shrishti Saha, et al.
Published: (2025)
Acoustic Teleportation via Disentangled Neural Audio Codec Representations
by: Grundhuber, Philipp, et al.
Published: (2025)
by: Grundhuber, Philipp, et al.
Published: (2025)
Dynamic Slimmable Networks for Efficient Speech Separation
by: Elminshawi, Mohamed, et al.
Published: (2025)
by: Elminshawi, Mohamed, et al.
Published: (2025)
Stereo Reproduction in the Presence of Sample Rate Offsets
by: Korse, Srikanth, et al.
Published: (2025)
by: Korse, Srikanth, et al.
Published: (2025)
What You Read Isn't What You Hear: Linguistic Sensitivity in Deepfake Speech Detection
by: Nguyen, Binh, et al.
Published: (2025)
by: Nguyen, Binh, et al.
Published: (2025)
Comparative Analysis Of Discriminative Deep Learning-Based Noise Reduction Methods In Low SNR Scenarios
by: Shetu, Shrishti Saha, et al.
Published: (2024)
by: Shetu, Shrishti Saha, et al.
Published: (2024)
On the Relation Between Speech Quality and Quantized Latent Representations of Neural Codecs
by: Halimeh, Mhd Modar, et al.
Published: (2025)
by: Halimeh, Mhd Modar, et al.
Published: (2025)
Room Impulse Response Completion Using Signal-Prediction Diffusion Models Conditioned on Simulated Early Reflections
by: Xu, Zeyu, et al.
Published: (2026)
by: Xu, Zeyu, et al.
Published: (2026)
Training Strategies for Modality Dropout Resilient Multi-Modal Target Speaker Extraction
by: Korse, Srikanth, et al.
Published: (2025)
by: Korse, Srikanth, et al.
Published: (2025)
ConcateNet: Dialogue Separation Using Local And Global Feature Concatenation
by: Halimeh, Mhd Modar, et al.
Published: (2024)
by: Halimeh, Mhd Modar, et al.
Published: (2024)
Blind Acoustic Parameter Estimation Through Task-Agnostic Embeddings Using Latent Approximations
by: Götz, Philipp, et al.
Published: (2024)
by: Götz, Philipp, et al.
Published: (2024)
NDF+: Joint Neural Directional Filtering and Diffuse Sound Extraction
by: Huang, Weilong, et al.
Published: (2026)
by: Huang, Weilong, et al.
Published: (2026)
Data-driven Joint Detection and Localization of Acoustic Reflectors
by: Bicer, H. Nazim, et al.
Published: (2024)
by: Bicer, H. Nazim, et al.
Published: (2024)
Low-Resource Text-to-Speech Synthesis Using Noise-Augmented Training of ForwardTacotron
by: Lakshminarayana, Kishor Kayyar, et al.
Published: (2025)
by: Lakshminarayana, Kishor Kayyar, et al.
Published: (2025)
Assessing the Impact of Noise and Speech Enhancement on the Intelligibility of Speech Codecs
by: Behringer, Lyonel, et al.
Published: (2026)
by: Behringer, Lyonel, et al.
Published: (2026)
Voice Privacy Preservation with Multiple Random Orthogonal Secret Keys: Attack Resistance Analysis
by: Tanaka, Kohei, et al.
Published: (2025)
by: Tanaka, Kohei, et al.
Published: (2025)
Neural Directional Filtering Using a Compact Microphone Array
by: Huang, Weilong, et al.
Published: (2025)
by: Huang, Weilong, et al.
Published: (2025)
Matching Reverberant Speech Through Learned Acoustic Embeddings and Feedback Delay Networks
by: Götz, Philipp, et al.
Published: (2025)
by: Götz, Philipp, et al.
Published: (2025)
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
by: Shetu, Shrishti Saha, et al.
Published: (2024)
by: Shetu, Shrishti Saha, et al.
Published: (2024)
Audio-Visual Speech Enhancement for Spatial Audio - Spatial-VisualVoice and the MAVE Database
by: Yaffe, Danielle, et al.
Published: (2025)
by: Yaffe, Danielle, et al.
Published: (2025)
Expanding and Analyzing ODAQ -- the Open Dataset of Audio Quality
by: Dick, Sascha, et al.
Published: (2025)
by: Dick, Sascha, et al.
Published: (2025)
Neural Directional Filtering: Far-Field Directivity Control With a Small Microphone Array
by: Wechsler, Julian, et al.
Published: (2024)
by: Wechsler, Julian, et al.
Published: (2024)
I Know You're Listening: Adaptive Voice for HRI
by: Tuttösí, Paige
Published: (2025)
by: Tuttösí, Paige
Published: (2025)
A Hybrid Approach for Low-Complexity Joint Acoustic Echo and Noise Reduction
by: Shetu, Shrishti Saha, et al.
Published: (2024)
by: Shetu, Shrishti Saha, et al.
Published: (2024)
Similar Items
-
VoxATtack: A Multimodal Attack on Voice Anonymization Systems
by: Aloradi, Ahmad, et al.
Published: (2025) -
Why disentanglement-based speaker anonymization systems fail at preserving emotions?
by: Gaznepoglu, Ünal Ege, et al.
Published: (2025) -
The Third VoicePrivacy Challenge: Preserving Emotional Expressiveness and Linguistic Content in Voice Anonymization
by: Tomashenko, Natalia, et al.
Published: (2026) -
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2024) -
The First VoicePrivacy Attacker Challenge
by: Tomashenko, Natalia, et al.
Published: (2025)