Saved in:
| Main Authors: | Xu, Xiaoran, Ra, In-Ho, Sankar, Ravi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.16845 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
by: Wu, Haibin, et al.
Published: (2021)
by: Wu, Haibin, et al.
Published: (2021)
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction
by: Brima, Yusuf, et al.
Published: (2023)
by: Brima, Yusuf, et al.
Published: (2023)
Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification
by: Miyaguchi, Anthony, et al.
Published: (2023)
by: Miyaguchi, Anthony, et al.
Published: (2023)
Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
by: Miara, Victor, et al.
Published: (2024)
by: Miara, Victor, et al.
Published: (2024)
BenSParX: A Robust Explainable Machine Learning Framework for Parkinson's Disease Detection from Bengali Conversational Speech
by: Hossain, Riad, et al.
Published: (2025)
by: Hossain, Riad, et al.
Published: (2025)
Reverse-Speech-Finder: A Neural Network Backtracking Architecture for Generating Alzheimer's Disease Speech Samples and Improving Diagnosis Performance
by: Li, Victor OK, et al.
Published: (2025)
by: Li, Victor OK, et al.
Published: (2025)
An AI-enabled Bias-Free Respiratory Disease Diagnosis Model using Cough Audio: A Case Study for COVID-19
by: Saeed, Tabish, et al.
Published: (2024)
by: Saeed, Tabish, et al.
Published: (2024)
Self-Supervised Learning for Speaker Recognition: A study and review
by: Lepage, Theo, et al.
Published: (2026)
by: Lepage, Theo, et al.
Published: (2026)
Singer Identity Representation Learning using Self-Supervised Techniques
by: Torres, Bernardo, et al.
Published: (2024)
by: Torres, Bernardo, et al.
Published: (2024)
Self-Supervised Learning for Few-Shot Bird Sound Classification
by: Moummad, Ilyass, et al.
Published: (2023)
by: Moummad, Ilyass, et al.
Published: (2023)
Self-Supervised Frameworks for Speaker Verification via Bootstrapped Positive Sampling
by: Lepage, Theo, et al.
Published: (2025)
by: Lepage, Theo, et al.
Published: (2025)
Enhancing Audio-Language Models through Self-Supervised Post-Training with Text-Audio Pairs
by: Sinha, Anshuman, et al.
Published: (2024)
by: Sinha, Anshuman, et al.
Published: (2024)
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning
by: Vaessen, Nik, et al.
Published: (2024)
by: Vaessen, Nik, et al.
Published: (2024)
Voice-Driven Mortality Prediction in Hospitalized Heart Failure Patients: A Machine Learning Approach Enhanced with Diagnostic Biomarkers
by: Ahmadli, Nihat, et al.
Published: (2024)
by: Ahmadli, Nihat, et al.
Published: (2024)
Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning
by: Lepage, Théo, et al.
Published: (2022)
by: Lepage, Théo, et al.
Published: (2022)
MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations
by: Heggan, Calum, et al.
Published: (2023)
by: Heggan, Calum, et al.
Published: (2023)
Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations
by: Lepage, Theo, et al.
Published: (2024)
by: Lepage, Theo, et al.
Published: (2024)
Improving Perceptual Audio Aesthetic Assessment via Triplet Loss and Self-Supervised Embeddings
by: Wisnu, Dyah A. M. G., et al.
Published: (2025)
by: Wisnu, Dyah A. M. G., et al.
Published: (2025)
TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR
by: Ravi, Nagarathna, et al.
Published: (2024)
by: Ravi, Nagarathna, et al.
Published: (2024)
SCRAPL: Scattering Transform with Random Paths for Machine Learning
by: Mitcheltree, Christopher, et al.
Published: (2026)
by: Mitcheltree, Christopher, et al.
Published: (2026)
Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
by: Kakoulidis, Panos, et al.
Published: (2024)
by: Kakoulidis, Panos, et al.
Published: (2024)
Voice Signal Processing for Machine Learning. The Case of Speaker Isolation
by: Ganchev, Radan
Published: (2024)
by: Ganchev, Radan
Published: (2024)
BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings
by: Charlot, Théo, et al.
Published: (2025)
by: Charlot, Théo, et al.
Published: (2025)
Windowed SummaryMixing: An Efficient Fine-Tuning of Self-Supervised Learning Models for Low-resource Speech Recognition
by: Menon, Aditya Srinivas, et al.
Published: (2026)
by: Menon, Aditya Srinivas, et al.
Published: (2026)
RCT: Random Consistency Training for Semi-supervised Sound Event Detection
by: Shao, Nian, et al.
Published: (2021)
by: Shao, Nian, et al.
Published: (2021)
Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification
by: Sundar, Anirudh S., et al.
Published: (2023)
by: Sundar, Anirudh S., et al.
Published: (2023)
Machine Learning Approaches to Vocal Register Classification in Contemporary Male Pop Music
by: Kim, Alexander, et al.
Published: (2025)
by: Kim, Alexander, et al.
Published: (2025)
Active Restoration of Lost Audio Signals Using Machine Learning and Latent Information
by: Cheddad, Zohra Adila, et al.
Published: (2021)
by: Cheddad, Zohra Adila, et al.
Published: (2021)
Toward Fully Self-Supervised Multi-Pitch Estimation
by: Cwitkowitz, Frank, et al.
Published: (2024)
by: Cwitkowitz, Frank, et al.
Published: (2024)
Self-Supervised Embeddings for Detecting Individual Symptoms of Depression
by: Dumpala, Sri Harsha, et al.
Published: (2024)
by: Dumpala, Sri Harsha, et al.
Published: (2024)
Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses
by: Jeong, Seung Gyu, et al.
Published: (2025)
by: Jeong, Seung Gyu, et al.
Published: (2025)
Acoustic and Machine Learning Methods for Speech-Based Suicide Risk Assessment: A Systematic Review
by: Marie, Ambre, et al.
Published: (2025)
by: Marie, Ambre, et al.
Published: (2025)
An AI-Driven Approach to Wind Turbine Bearing Fault Diagnosis from Acoustic Signals
by: Wang, Zhao, et al.
Published: (2024)
by: Wang, Zhao, et al.
Published: (2024)
Investigating an Overfitting and Degeneration Phenomenon in Self-Supervised Multi-Pitch Estimation
by: Cwitkowitz, Frank, et al.
Published: (2025)
by: Cwitkowitz, Frank, et al.
Published: (2025)
On the Transferability of Large-Scale Self-Supervision to Few-Shot Audio Classification
by: Heggan, Calum, et al.
Published: (2024)
by: Heggan, Calum, et al.
Published: (2024)
Detecting Throat Cancer from Speech Signals using Machine Learning: A Scoping Literature Review
by: Paterson, Mary, et al.
Published: (2023)
by: Paterson, Mary, et al.
Published: (2023)
Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
by: Bai, Jisheng, et al.
Published: (2024)
by: Bai, Jisheng, et al.
Published: (2024)
Singing Voice Conversion with Accompaniment Using Self-Supervised Representation-Based Melody Features
by: Chen, Wei, et al.
Published: (2025)
by: Chen, Wei, et al.
Published: (2025)
Phoneme-Level Deepfake Detection Across Emotional Conditions Using Self-Supervised Embeddings
by: Nallaguntla, Vamshi, et al.
Published: (2026)
by: Nallaguntla, Vamshi, et al.
Published: (2026)
Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies
by: Anand, Srija, et al.
Published: (2024)
by: Anand, Srija, et al.
Published: (2024)
Similar Items
-
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
by: Wu, Haibin, et al.
Published: (2021) -
Understanding Self-Supervised Learning of Speech Representation via Invariance and Redundancy Reduction
by: Brima, Yusuf, et al.
Published: (2023) -
Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification
by: Miyaguchi, Anthony, et al.
Published: (2023) -
Towards Supervised Performance on Speaker Verification with Self-Supervised Learning by Leveraging Large-Scale ASR Models
by: Miara, Victor, et al.
Published: (2024) -
BenSParX: A Robust Explainable Machine Learning Framework for Parkinson's Disease Detection from Bengali Conversational Speech
by: Hossain, Riad, et al.
Published: (2025)