Saved in:
| Main Authors: | Pang, Yutian, Kendall, Andrew Paul, Porcayo, Alex, Barsotti, Mariah, Jain, Anahita, Clarke, John-Paul |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.04974 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding
by: Huo, Mingyue, et al.
Published: (2025)
by: Huo, Mingyue, et al.
Published: (2025)
The Risks and Detection of Overestimated Privacy Protection in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2025)
by: Panariello, Michele, et al.
Published: (2025)
Face-Voice Association for Audiovisual Active Speaker Detection in Egocentric Recordings
by: Clarke, Jason, et al.
Published: (2025)
by: Clarke, Jason, et al.
Published: (2025)
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
by: Bereuter, Paul A., et al.
Published: (2025)
by: Bereuter, Paul A., et al.
Published: (2025)
VoiceSculptor: Your Voice, Designed By You
by: Hu, Jingbin, et al.
Published: (2026)
by: Hu, Jingbin, et al.
Published: (2026)
DreamVoice: Text-Guided Voice Conversion
by: Hai, Jiarui, et al.
Published: (2024)
by: Hai, Jiarui, et al.
Published: (2024)
Voices of Civilizations: A Multilingual QA Benchmark for Global Music Understanding
by: Wu, Shangda, et al.
Published: (2026)
by: Wu, Shangda, et al.
Published: (2026)
The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2024)
by: Panariello, Michele, et al.
Published: (2024)
Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
by: Salman, Ali N., et al.
Published: (2024)
by: Salman, Ali N., et al.
Published: (2024)
A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms
by: Monir, Nasser-Eddine, et al.
Published: (2024)
by: Monir, Nasser-Eddine, et al.
Published: (2024)
Fair-Gate: Fairness-Aware Interpretable Risk Gating for Sex-Fair Voice Biometrics
by: Qu, Yangyang, et al.
Published: (2026)
by: Qu, Yangyang, et al.
Published: (2026)
Use Cases for Voice Anonymization
by: Meyer, Sarina, et al.
Published: (2025)
by: Meyer, Sarina, et al.
Published: (2025)
Kinship Verification Using Voice
by: Mishra, Jagabandhu, et al.
Published: (2026)
by: Mishra, Jagabandhu, et al.
Published: (2026)
Compact Neural TTS Voices for Accessibility
by: Jain, Kunal, et al.
Published: (2025)
by: Jain, Kunal, et al.
Published: (2025)
Quality Assessment of Noisy and Enhanced Speech with Limited Data: UWB-NTIS System for VoiceMOS 2024
by: Kunešová, Marie, et al.
Published: (2025)
by: Kunešová, Marie, et al.
Published: (2025)
EchoVoices: Preserving Generational Voices and Memories for Seniors and Children
by: Xu, Haiying, et al.
Published: (2025)
by: Xu, Haiying, et al.
Published: (2025)
Turbocharge Speech Understanding with Pilot Inference
by: Wang, Rongxiang, et al.
Published: (2023)
by: Wang, Rongxiang, et al.
Published: (2023)
Noise-Robust Hearing Aid Voice Control
by: López-Espejo, Iván, et al.
Published: (2024)
by: López-Espejo, Iván, et al.
Published: (2024)
Human Voice is Unique
by: Singh, Rita, et al.
Published: (2025)
by: Singh, Rita, et al.
Published: (2025)
StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
by: Wang, Zhichao, et al.
Published: (2024)
by: Wang, Zhichao, et al.
Published: (2024)
VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
by: Rahimi, Akam, et al.
Published: (2025)
by: Rahimi, Akam, et al.
Published: (2025)
NPU-NTU System for Voice Privacy 2024 Challenge
by: Yao, Jixun, et al.
Published: (2024)
by: Yao, Jixun, et al.
Published: (2024)
Voice Conversion Augmentation for Speaker Recognition on Defective Datasets
by: Tao, Ruijie, et al.
Published: (2024)
by: Tao, Ruijie, et al.
Published: (2024)
Robust Speech Activity Detection in the Presence of Singing Voice
by: Grundhuber, Philipp, et al.
Published: (2025)
by: Grundhuber, Philipp, et al.
Published: (2025)
Advancing Airport Tower Command Recognition: Integrating Squeeze-and-Excitation and Broadcasted Residual Learning
by: Lin, Yuanxi, et al.
Published: (2024)
by: Lin, Yuanxi, et al.
Published: (2024)
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
by: Seki, Kentaro, et al.
Published: (2024)
by: Seki, Kentaro, et al.
Published: (2024)
LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
by: Kameoka, Hirokazu, et al.
Published: (2025)
by: Kameoka, Hirokazu, et al.
Published: (2025)
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
by: Kameoka, Hirokazu, et al.
Published: (2020)
by: Kameoka, Hirokazu, et al.
Published: (2020)
Voice-ENHANCE: Speech Restoration using a Diffusion-based Voice Conversion Framework
by: Byun, Kyungguen, et al.
Published: (2025)
by: Byun, Kyungguen, et al.
Published: (2025)
Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
by: Sekkat, Chloé, et al.
Published: (2024)
by: Sekkat, Chloé, et al.
Published: (2024)
Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment
by: Cai, Huanchen, et al.
Published: (2026)
by: Cai, Huanchen, et al.
Published: (2026)
SingIt! Singer Voice Transformation
by: Eliav, Amit, et al.
Published: (2024)
by: Eliav, Amit, et al.
Published: (2024)
Controlling your Attributes in Voice
by: Li, Xuyuan, et al.
Published: (2025)
by: Li, Xuyuan, et al.
Published: (2025)
Objective Measurements of Voice Quality
by: Dhamyal, Hira, et al.
Published: (2024)
by: Dhamyal, Hira, et al.
Published: (2024)
Interpreting Pretrained Speech Models for Automatic Speech Assessment of Voice Disorders
by: Lau, Hok-Shing, et al.
Published: (2024)
by: Lau, Hok-Shing, et al.
Published: (2024)
OneVoice: One Model, Triple Scenarios-Towards Unified Zero-shot Voice Conversion
by: Wang, Zhichao, et al.
Published: (2026)
by: Wang, Zhichao, et al.
Published: (2026)
Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
by: Du, Zongyang, et al.
Published: (2024)
by: Du, Zongyang, et al.
Published: (2024)
TidyVoice: A Curated Multilingual Dataset for Speaker Verification Derived from Common Voice
by: Farhadipour, Aref, et al.
Published: (2026)
by: Farhadipour, Aref, et al.
Published: (2026)
Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion
by: Ruggiero, Giuseppe, et al.
Published: (2024)
by: Ruggiero, Giuseppe, et al.
Published: (2024)
First Steps Towards Voice Anonymization for Code-Switching Speech
by: Meyer, Sarina, et al.
Published: (2025)
by: Meyer, Sarina, et al.
Published: (2025)
Similar Items
-
Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding
by: Huo, Mingyue, et al.
Published: (2025) -
The Risks and Detection of Overestimated Privacy Protection in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2025) -
Face-Voice Association for Audiovisual Active Speaker Detection in Egocentric Recordings
by: Clarke, Jason, et al.
Published: (2025) -
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
by: Bereuter, Paul A., et al.
Published: (2025) -
VoiceSculptor: Your Voice, Designed By You
by: Hu, Jingbin, et al.
Published: (2026)