Saved in:
| Main Authors: | Xuan, Xi, Carbone, Davide, Zhang, Wenxin, Pandey, Ruchi, Kinnunen, Tomi H. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02980 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
by: Xuan, Xi, et al.
Published: (2025)
by: Xuan, Xi, et al.
Published: (2025)
Multilingual Source Tracing of Speech Deepfakes: A First Benchmark
by: Xuan, Xi, et al.
Published: (2025)
by: Xuan, Xi, et al.
Published: (2025)
Disentangling Speaker Traits for Deepfake Source Verification via Chebyshev Polynomial and Riemannian Metric Learning
by: Xuan, Xi, et al.
Published: (2026)
by: Xuan, Xi, et al.
Published: (2026)
Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention's Alternative
by: Xuan, Xi, et al.
Published: (2025)
by: Xuan, Xi, et al.
Published: (2025)
Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection
by: Hanilçi, Cemal, et al.
Published: (2026)
by: Hanilçi, Cemal, et al.
Published: (2026)
Advancing Zero-Shot Open-Set Speech Deepfake Source Tracing
by: Chhibber, Manasi, et al.
Published: (2025)
by: Chhibber, Manasi, et al.
Published: (2025)
Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes
by: Kukanov, Ivan, et al.
Published: (2024)
by: Kukanov, Ivan, et al.
Published: (2024)
Heart Murmur and Abnormal PCG Detection via Wavelet Scattering Transform & a 1D-CNN
by: Patwa, Ahmed, et al.
Published: (2023)
by: Patwa, Ahmed, et al.
Published: (2023)
Towards Explainable Spoofed Speech Attribution and Detection:a Probabilistic Approach for Characterizing Speech Synthesizer Components
by: Mishra, Jagabandhu, et al.
Published: (2025)
by: Mishra, Jagabandhu, et al.
Published: (2025)
Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation
by: Kim, Minsu, et al.
Published: (2023)
by: Kim, Minsu, et al.
Published: (2023)
Kinship Verification Using Voice
by: Mishra, Jagabandhu, et al.
Published: (2026)
by: Mishra, Jagabandhu, et al.
Published: (2026)
Benchmarking Audio Deepfake Detection Robustness in Real-world Communication Scenarios
by: Shi, Haohan, et al.
Published: (2025)
by: Shi, Haohan, et al.
Published: (2025)
Visual-Aware Speech Recognition for Noisy Scenarios
by: Balaji, Lakshmipathi, et al.
Published: (2025)
by: Balaji, Lakshmipathi, et al.
Published: (2025)
Reduce Computational Complexity for Continuous Wavelet Transform in Acoustic Recognition Using Hop Size
by: Phan, Dang Thoai
Published: (2024)
by: Phan, Dang Thoai
Published: (2024)
A Large-Scale Evaluation of Speech Foundation Models
by: Yang, Shu-wen, et al.
Published: (2024)
by: Yang, Shu-wen, et al.
Published: (2024)
TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch
by: Song, Xingchen, et al.
Published: (2024)
by: Song, Xingchen, et al.
Published: (2024)
Attentive Fusion: A Transformer-based Approach to Multimodal Hate Speech Detection
by: Mandal, Atanu, et al.
Published: (2024)
by: Mandal, Atanu, et al.
Published: (2024)
Causal Structure Discovery for Error Diagnostics of Children's ASR
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications
by: Wills, Simone, et al.
Published: (2023)
by: Wills, Simone, et al.
Published: (2023)
Beyond Identity: A Generalizable Approach for Deepfake Audio Detection
by: Ahmadiadli, Yasaman, et al.
Published: (2025)
by: Ahmadiadli, Yasaman, et al.
Published: (2025)
RIR-Mega-Speech: A Reverberant Speech Corpus with Comprehensive Acoustic Metadata and Reproducible Evaluation
by: Goswami, Mandip
Published: (2026)
by: Goswami, Mandip
Published: (2026)
An Explainable Probabilistic Attribute Embedding Approach for Spoofed Speech Characterization
by: Chhibber, Manasi, et al.
Published: (2024)
by: Chhibber, Manasi, et al.
Published: (2024)
Impact of Microphone Array Mismatches to Learning-based Replay Speech Detection
by: Neri, Michael, et al.
Published: (2025)
by: Neri, Michael, et al.
Published: (2025)
Multi-channel Replay Speech Detection using an Adaptive Learnable Beamformer
by: Neri, Michael, et al.
Published: (2025)
by: Neri, Michael, et al.
Published: (2025)
Scattering Transform for Auditory Attention Decoding
by: Pallenberg, René, et al.
Published: (2026)
by: Pallenberg, René, et al.
Published: (2026)
Investigating the Potential of Multi-Stage Score Fusion in Spoofing-Aware Speaker Verification
by: Kurnaz, Oguzhan, et al.
Published: (2025)
by: Kurnaz, Oguzhan, et al.
Published: (2025)
SpeechMLC: Speech Multi-label Classification
by: Kim, Miseul, et al.
Published: (2025)
by: Kim, Miseul, et al.
Published: (2025)
Wavelet GPT: Wavelet Inspired Large Language Models
by: Verma, Prateek
Published: (2024)
by: Verma, Prateek
Published: (2024)
A SUPERB-Style Benchmark of Self-Supervised Speech Models for Audio Deepfake Detection
by: Ali, Hashim, et al.
Published: (2026)
by: Ali, Hashim, et al.
Published: (2026)
Detection of manatee vocalisations using the Audio Spectrogram Transformer
by: Schiappacasse, Stefano, et al.
Published: (2024)
by: Schiappacasse, Stefano, et al.
Published: (2024)
Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features
by: Kwon, Younghoo, et al.
Published: (2024)
by: Kwon, Younghoo, et al.
Published: (2024)
Parameter-Efficient Fine-Tuning of Foundation Models for CLP Speech Classification
by: Bhattacharjee, Susmita, et al.
Published: (2025)
by: Bhattacharjee, Susmita, et al.
Published: (2025)
MultiGen: Child-Friendly Multilingual Speech Generator with LLMs
by: Gao, Xiaoxue, et al.
Published: (2025)
by: Gao, Xiaoxue, et al.
Published: (2025)
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation
by: Khurana, Sameer, et al.
Published: (2023)
by: Khurana, Sameer, et al.
Published: (2023)
A Speech Production Model for Radar: Connecting Speech Acoustics with Radar-Measured Vibrations
by: Lenz, Isabella, et al.
Published: (2025)
by: Lenz, Isabella, et al.
Published: (2025)
Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
by: Berghi, Davide, et al.
Published: (2025)
by: Berghi, Davide, et al.
Published: (2025)
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
by: Singh, Vishwanath Pratap, et al.
Published: (2025)
ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction
by: Yang, Shu-wen, et al.
Published: (2025)
by: Yang, Shu-wen, et al.
Published: (2025)
I Hear, Therefore I Trust: A Socio-Technical Investigation of Humans as Synthetic Speech Detectors
by: Erscoi, Lelia, et al.
Published: (2026)
by: Erscoi, Lelia, et al.
Published: (2026)
Binaural Localization Model for Speech in Noise
by: Tokala, Vikas, et al.
Published: (2025)
by: Tokala, Vikas, et al.
Published: (2025)
Similar Items
-
WaveSP-Net: Learnable Wavelet-Domain Sparse Prompt Tuning for Speech Deepfake Detection
by: Xuan, Xi, et al.
Published: (2025) -
Multilingual Source Tracing of Speech Deepfakes: A First Benchmark
by: Xuan, Xi, et al.
Published: (2025) -
Disentangling Speaker Traits for Deepfake Source Verification via Chebyshev Polynomial and Riemannian Metric Learning
by: Xuan, Xi, et al.
Published: (2026) -
Fake-Mamba: Real-Time Speech Deepfake Detection Using Bidirectional Mamba as Self-Attention's Alternative
by: Xuan, Xi, et al.
Published: (2025) -
Cyclostationarity Analysis as a Complement to Self-Supervised Representations for Speech Deepfake Detection
by: Hanilçi, Cemal, et al.
Published: (2026)