:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pang, Yutian, Kendall, Andrew Paul, Porcayo, Alex, Barsotti, Mariah, Jain, Anahita, Clarke, John-Paul
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2503.04974
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding
by: Huo, Mingyue, et al.
Published: (2025)

The Risks and Detection of Overestimated Privacy Protection in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2025)

Face-Voice Association for Audiovisual Active Speaker Detection in Egocentric Recordings
by: Clarke, Jason, et al.
Published: (2025)

Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models
by: Bereuter, Paul A., et al.
Published: (2025)

VoiceSculptor: Your Voice, Designed By You
by: Hu, Jingbin, et al.
Published: (2026)

DreamVoice: Text-Guided Voice Conversion
by: Hai, Jiarui, et al.
Published: (2024)

Voices of Civilizations: A Multilingual QA Benchmark for Global Music Understanding
by: Wu, Shangda, et al.
Published: (2026)

The VoicePrivacy 2022 Challenge: Progress and Perspectives in Voice Anonymisation
by: Panariello, Michele, et al.
Published: (2024)

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline
by: Salman, Ali N., et al.
Published: (2024)

A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms
by: Monir, Nasser-Eddine, et al.
Published: (2024)

Fair-Gate: Fairness-Aware Interpretable Risk Gating for Sex-Fair Voice Biometrics
by: Qu, Yangyang, et al.
Published: (2026)

Use Cases for Voice Anonymization
by: Meyer, Sarina, et al.
Published: (2025)

Kinship Verification Using Voice
by: Mishra, Jagabandhu, et al.
Published: (2026)

Compact Neural TTS Voices for Accessibility
by: Jain, Kunal, et al.
Published: (2025)

Quality Assessment of Noisy and Enhanced Speech with Limited Data: UWB-NTIS System for VoiceMOS 2024
by: Kunešová, Marie, et al.
Published: (2025)

EchoVoices: Preserving Generational Voices and Memories for Seniors and Children
by: Xu, Haiying, et al.
Published: (2025)

Turbocharge Speech Understanding with Pilot Inference
by: Wang, Rongxiang, et al.
Published: (2023)

Noise-Robust Hearing Aid Voice Control
by: López-Espejo, Iván, et al.
Published: (2024)

Human Voice is Unique
by: Singh, Rita, et al.
Published: (2025)

StreamVoice+: Evolving into End-to-end Streaming Zero-shot Voice Conversion
by: Wang, Zhichao, et al.
Published: (2024)

VoiceVector: Multimodal Enrolment Vectors for Speaker Separation
by: Rahimi, Akam, et al.
Published: (2025)

NPU-NTU System for Voice Privacy 2024 Challenge
by: Yao, Jixun, et al.
Published: (2024)

Voice Conversion Augmentation for Speaker Recognition on Defective Datasets
by: Tao, Ruijie, et al.
Published: (2024)

Robust Speech Activity Detection in the Presence of Singing Voice
by: Grundhuber, Philipp, et al.
Published: (2025)

Advancing Airport Tower Command Recognition: Integrating Squeeze-and-Excitation and Broadcasted Residual Learning
by: Lin, Yuanxi, et al.
Published: (2024)

Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
by: Seki, Kentaro, et al.
Published: (2024)

LatentVoiceGrad: Nonparallel Voice Conversion with Latent Diffusion/Flow-Matching Models
by: Kameoka, Hirokazu, et al.
Published: (2025)

VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
by: Kameoka, Hirokazu, et al.
Published: (2020)

Voice-ENHANCE: Speech Restoration using a Diffusion-based Voice Conversion Framework
by: Byun, Kyungguen, et al.
Published: (2025)

Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants
by: Sekkat, Chloé, et al.
Published: (2024)

Voice Mapping of Text-to-Speech Systems: A Metric-Based Approach for Voice Quality Assessment
by: Cai, Huanchen, et al.
Published: (2026)

SingIt! Singer Voice Transformation
by: Eliav, Amit, et al.
Published: (2024)

Controlling your Attributes in Voice
by: Li, Xuyuan, et al.
Published: (2025)

Objective Measurements of Voice Quality
by: Dhamyal, Hira, et al.
Published: (2024)

Interpreting Pretrained Speech Models for Automatic Speech Assessment of Voice Disorders
by: Lau, Hok-Shing, et al.
Published: (2024)

OneVoice: One Model, Triple Scenarios-Towards Unified Zero-shot Voice Conversion
by: Wang, Zhichao, et al.
Published: (2026)

Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model
by: Du, Zongyang, et al.
Published: (2024)

TidyVoice: A Curated Multilingual Dataset for Speaker Verification Derived from Common Voice
by: Farhadipour, Aref, et al.
Published: (2026)

Enhancing Polyglot Voices by Leveraging Cross-Lingual Fine-Tuning in Any-to-One Voice Conversion
by: Ruggiero, Giuseppe, et al.
Published: (2024)

First Steps Towards Voice Anonymization for Code-Switching Speech
by: Meyer, Sarina, et al.
Published: (2025)