Saved in:
| Main Authors: | Rahman, Md. Abdur, Thuseethan, Selvarajah, Yeo, Kheng Cher, Mohamed, Reem E., Azam, Sami |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.00522 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
An Innovative Coverage Path Planning Approach for UAVs to Boost Precision Agriculture and Rescue Operations
by: Nur Mohammad Fahad, et al.
Published: (2025)
by: Nur Mohammad Fahad, et al.
Published: (2025)
BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
by: Abian, Arefin Ittesafun, et al.
Published: (2025)
by: Abian, Arefin Ittesafun, et al.
Published: (2025)
DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
by: Zaman, Sayeem Been, et al.
Published: (2025)
by: Zaman, Sayeem Been, et al.
Published: (2025)
Predicting Postresection Colorectal Liver Metastases Recurrence Using Advanced Graph Neural Networks with Explainability and Causal Inference
by: Jubair Ahmed, et al.
Published: (2025)
by: Jubair Ahmed, et al.
Published: (2025)
Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications
by: Islam, Md. Adnanul, et al.
Published: (2026)
by: Islam, Md. Adnanul, et al.
Published: (2026)
Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification
by: Cai, Yiqiang, et al.
Published: (2024)
by: Cai, Yiqiang, et al.
Published: (2024)
A Source-Free Approach for Domain Adaptation via Multiview Image Transformation and Latent Space Consistency
by: Sutradhar, Debopom, et al.
Published: (2026)
by: Sutradhar, Debopom, et al.
Published: (2026)
Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
by: Quelennec, Aurian, et al.
Published: (2025)
by: Quelennec, Aurian, et al.
Published: (2025)
From Birdsong to Rumbles: Classifying Elephant Calls with Out-of-Species Embeddings
by: Geldenhuys, Christiaan M., et al.
Published: (2026)
by: Geldenhuys, Christiaan M., et al.
Published: (2026)
Deepfake Audio Detection Using Self-supervised Fusion Representations
by: Zaman, Khalid, et al.
Published: (2026)
by: Zaman, Khalid, et al.
Published: (2026)
WeCKD: Weakly-supervised Chained Distillation Network for Efficient Multimodal Medical Imaging
by: Rahman, Md. Abdur, et al.
Published: (2025)
by: Rahman, Md. Abdur, et al.
Published: (2025)
Self-supervised Learning for Acoustic Few-Shot Classification
by: Liang, Jingyong, et al.
Published: (2024)
by: Liang, Jingyong, et al.
Published: (2024)
Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning
by: Cai, Danwei, et al.
Published: (2024)
by: Cai, Danwei, et al.
Published: (2024)
Self-supervised Multimodal Speech Representations for the Assessment of Schizophrenia Symptoms
by: Premananth, Gowtham, et al.
Published: (2024)
by: Premananth, Gowtham, et al.
Published: (2024)
Implicit Self-supervised Language Representation for Spoken Language Diarization
by: Mishra, Jagabandhu, et al.
Published: (2023)
by: Mishra, Jagabandhu, et al.
Published: (2023)
[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic
by: Choi, Kwanghee, et al.
Published: (2026)
by: Choi, Kwanghee, et al.
Published: (2026)
Voice Biomarker Analysis and Automated Severity Classification of Dysarthric Speech in a Multilingual Context
by: Yeo, Eunjung
Published: (2024)
by: Yeo, Eunjung
Published: (2024)
SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
by: Meghanani, Amit, et al.
Published: (2024)
by: Meghanani, Amit, et al.
Published: (2024)
Prosodic ABX: A Language-Agnostic Method for Measuring Prosodic Contrast in Speech Representations
by: Sun, Haitong, et al.
Published: (2026)
by: Sun, Haitong, et al.
Published: (2026)
Optimized Self-supervised Training with BEST-RQ for Speech Recognition
by: Baumann, Ilja, et al.
Published: (2025)
by: Baumann, Ilja, et al.
Published: (2025)
Self-supervised Speech Representations Still Struggle with African American Vernacular English
by: Chang, Kalvin, et al.
Published: (2024)
by: Chang, Kalvin, et al.
Published: (2024)
SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition
by: Xue, Hongfei, et al.
Published: (2023)
by: Xue, Hongfei, et al.
Published: (2023)
SS-DPPN: A self-supervised dual-path foundation model for the generalizable cardiac audio representation
by: Muna, Ummy Maria, et al.
Published: (2025)
by: Muna, Ummy Maria, et al.
Published: (2025)
Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features
by: Tsunoo, Emiru, et al.
Published: (2024)
by: Tsunoo, Emiru, et al.
Published: (2024)
Learning Domain-Robust Bioacoustic Representations for Mosquito Species Classification with Contrastive Learning and Distribution Alignment
by: Hou, Yuanbo, et al.
Published: (2025)
by: Hou, Yuanbo, et al.
Published: (2025)
Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
by: Wang, Wupeng, et al.
Published: (2025)
by: Wang, Wupeng, et al.
Published: (2025)
SoundPlot: An Open-Source Framework for Birdsong Acoustic Analysis and Neural Synthesis with Interactive 3D Visualization
by: Mehdi, Naqcho Ali, et al.
Published: (2026)
by: Mehdi, Naqcho Ali, et al.
Published: (2026)
STONE: Self-supervised Tonality Estimator
by: Kong, Yuexuan, et al.
Published: (2024)
by: Kong, Yuexuan, et al.
Published: (2024)
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks
by: Meghanani, Amit, et al.
Published: (2024)
by: Meghanani, Amit, et al.
Published: (2024)
Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
by: Liu, Alexander H., et al.
Published: (2024)
by: Liu, Alexander H., et al.
Published: (2024)
Position-invariant Fine-tuning of Speech Enhancement Models with Self-supervised Speech Representations
by: Meghanani, Amit, et al.
Published: (2026)
by: Meghanani, Amit, et al.
Published: (2026)
Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
by: Meghanani, Amit, et al.
Published: (2024)
by: Meghanani, Amit, et al.
Published: (2024)
Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations
by: Zaiem, Salah, et al.
Published: (2024)
by: Zaiem, Salah, et al.
Published: (2024)
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
by: Gong, Cheng, et al.
Published: (2023)
by: Gong, Cheng, et al.
Published: (2023)
Selection of Layers from Self-supervised Learning Models for Predicting Mean-Opinion-Score of Speech
by: Liang, Xinyu, et al.
Published: (2025)
by: Liang, Xinyu, et al.
Published: (2025)
The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning
by: Vaessen, Nik, et al.
Published: (2024)
by: Vaessen, Nik, et al.
Published: (2024)
Multi-Class-Token Transformer for Multitask Self-supervised Music Information Retrieval
by: Kong, Yuexuan, et al.
Published: (2025)
by: Kong, Yuexuan, et al.
Published: (2025)
MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model
by: Shi, Jiatong, et al.
Published: (2024)
by: Shi, Jiatong, et al.
Published: (2024)
Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning
by: Zhu, Xinfa, et al.
Published: (2023)
by: Zhu, Xinfa, et al.
Published: (2023)
oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models
by: Dip, Muhammad Sudipto Siam, et al.
Published: (2024)
by: Dip, Muhammad Sudipto Siam, et al.
Published: (2024)
Similar Items
-
An Innovative Coverage Path Planning Approach for UAVs to Boost Precision Agriculture and Rescue Operations
by: Nur Mohammad Fahad, et al.
Published: (2025) -
BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
by: Abian, Arefin Ittesafun, et al.
Published: (2025) -
DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
by: Zaman, Sayeem Been, et al.
Published: (2025) -
Predicting Postresection Colorectal Liver Metastases Recurrence Using Advanced Graph Neural Networks with Explainability and Causal Inference
by: Jubair Ahmed, et al.
Published: (2025) -
Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications
by: Islam, Md. Adnanul, et al.
Published: (2026)