:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xuan, Xi, Carbone, Davide, Zhang, Wenxin, Pandey, Ruchi, Kinnunen, Tomi H.
Format:	Preprint
Published:	2026
Subjects:	Audio and Speech Processing Computation and Language Signal Processing
Online Access:	https://arxiv.org/abs/2602.02980
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
by: Xuan, Xi, et al.
Published: (2025)

Multilingual Source Tracing of Speech Deepfakes: A First Benchmark
by: Xuan, Xi, et al.
Published: (2025)

Disentangling Speaker Traits for Deepfake Source Verification via Chebyshev Polynomial and Riemannian Metric Learning
by: Xuan, Xi, et al.
Published: (2026)

Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention's Alternative
by: Xuan, Xi, et al.
Published: (2025)

Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection
by: Hanilçi, Cemal, et al.
Published: (2026)

Advancing Zero-Shot Open-Set Speech Deepfake Source Tracing
by: Chhibber, Manasi, et al.
Published: (2025)

Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes
by: Kukanov, Ivan, et al.
Published: (2024)

Heart Murmur and Abnormal PCG Detection via Wavelet Scattering Transform & a 1D-CNN
by: Patwa, Ahmed, et al.
Published: (2023)

Towards Explainable Spoofed Speech Attribution and Detection:a Probabilistic Approach for Characterizing Speech Synthesizer Components
by: Mishra, Jagabandhu, et al.
Published: (2025)

Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
by: Kim, Minsu, et al.
Published: (2023)

Kinship Verification Using Voice
by: Mishra, Jagabandhu, et al.
Published: (2026)

Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios
by: Shi, Haohan, et al.
Published: (2025)

Visual-Aware Speech Recognition for Noisy Scenarios
by: Balaji, Lakshmipathi, et al.
Published: (2025)

Reduce Computational Complexity for Continuous Wavelet Transform in Acoustic Recognition Using Hop Size
by: Phan, Dang Thoai
Published: (2024)

A Large-Scale Evaluation of Speech Foundation Models
by: Yang, Shu-wen, et al.
Published: (2024)

TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch
by: Song, Xingchen, et al.
Published: (2024)

Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection
by: Mandal, Atanu, et al.
Published: (2024)

Causal Structure Discovery for Error Diagnostics of Children's ASR
by: Singh, Vishwanath Pratap, et al.
Published: (2025)

Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications
by: Wills, Simone, et al.
Published: (2023)

Beyond Identity: A Generalizable Approach for Deepfake Audio Detection
by: Ahmadiadli, Yasaman, et al.
Published: (2025)

RIR-Mega-Speech: A Reverberant Speech Corpus with Comprehensive Acoustic Metadata and Reproducible Evaluation
by: Goswami, Mandip
Published: (2026)

An Explainable Probabilistic Attribute Embedding Approach for Spoofed Speech Characterization
by: Chhibber, Manasi, et al.
Published: (2024)

Impact of Microphone Array Mismatches to Learning-based Replay Speech Detection
by: Neri, Michael, et al.
Published: (2025)

Multi-channel Replay Speech Detection using an Adaptive Learnable Beamformer
by: Neri, Michael, et al.
Published: (2025)

Scattering Transform for Auditory Attention Decoding
by: Pallenberg, René, et al.
Published: (2026)

Investigating the Potential of Multi-Stage Score Fusion in Spoofing-Aware Speaker Verification
by: Kurnaz, Oguzhan, et al.
Published: (2025)

SpeechMLC: Speech Multi-label Classification
by: Kim, Miseul, et al.
Published: (2025)

Wavelet GPT: Wavelet Inspired Large Language Models
by: Verma, Prateek
Published: (2024)

A SUPERB-Style Benchmark of Self-Supervised Speech Models for Audio Deepfake Detection
by: Ali, Hashim, et al.
Published: (2026)

Detection of manatee vocalisations using the Audio Spectrogram Transformer
by: Schiappacasse, Stefano, et al.
Published: (2024)

Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features
by: Kwon, Younghoo, et al.
Published: (2024)

Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification
by: Bhattacharjee, Susmita, et al.
Published: (2025)

MultiGen: Child-Friendly Multilingual Speech Generator with LLMs
by: Gao, Xiaoxue, et al.
Published: (2025)

Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
by: Khurana, Sameer, et al.
Published: (2023)

A Speech Production Model for Radar: Connecting Speech Acoustics with Radar-Measured Vibrations
by: Lenz, Isabella, et al.
Published: (2025)

Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
by: Berghi, Davide, et al.
Published: (2025)

Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors
by: Singh, Vishwanath Pratap, et al.
Published: (2025)

ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction
by: Yang, Shu-wen, et al.
Published: (2025)

I Hear, Therefore I Trust: A Socio-Technical Investigation of Humans as Synthetic Speech Detectors
by: Erscoi, Lelia, et al.
Published: (2026)

Binaural Localization Model for Speech in Noise
by: Tokala, Vikas, et al.
Published: (2025)