:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rahman, Md. Abdur, Thuseethan, Selvarajah, Yeo, Kheng Cher, Mohamed, Reem E., Azam, Sami
Format:	Preprint
Published:	2025
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2510.00522
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

An Innovative Coverage Path Planning Approach for UAVs to Boost Precision Agriculture and Rescue Operations
by: Nur Mohammad Fahad, et al.
Published: (2025)

BioAutoML-NAS: An End-to-End AutoML Framework for Multimodal Insect Classification via Neural Architecture Search on Large-Scale Biodiversity Data
by: Abian, Arefin Ittesafun, et al.
Published: (2025)

DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection
by: Zaman, Sayeem Been, et al.
Published: (2025)

Predicting Postresection Colorectal Liver Metastases Recurrence Using Advanced Graph Neural Networks with Explainability and Causal Inference
by: Jubair Ahmed, et al.
Published: (2025)

Learning to Weigh Waste: A Physics-Informed Multimodal Fusion Framework and Large-Scale Dataset for Commercial and Industrial Applications
by: Islam, Md. Adnanul, et al.
Published: (2026)

Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification
by: Cai, Yiqiang, et al.
Published: (2024)

A Source-Free Approach for Domain Adaptation via Multiview Image Transformation and Latent Space Consistency
by: Sutradhar, Debopom, et al.
Published: (2026)

Masked Latent Prediction and Classification for Self-Supervised Audio Representation Learning
by: Quelennec, Aurian, et al.
Published: (2025)

From Birdsong to Rumbles: Classifying Elephant Calls with Out-of-Species Embeddings
by: Geldenhuys, Christiaan M., et al.
Published: (2026)

Deepfake Audio Detection Using Self-supervised Fusion Representations
by: Zaman, Khalid, et al.
Published: (2026)

WeCKD: Weakly-supervised Chained Distillation Network for Efficient Multimodal Medical Imaging
by: Rahman, Md. Abdur, et al.
Published: (2025)

Self-supervised Learning for Acoustic Few-Shot Classification
by: Liang, Jingyong, et al.
Published: (2024)

Self-supervised Reflective Learning through Self-distillation and Online Clustering for Speaker Representation Learning
by: Cai, Danwei, et al.
Published: (2024)

Self-supervised Multimodal Speech Representations for the Assessment of Schizophrenia Symptoms
by: Premananth, Gowtham, et al.
Published: (2024)

Implicit Self-supervised Language Representation for Spoken Language Diarization
by: Mishra, Jagabandhu, et al.
Published: (2023)

[b]=[d]-[t]+[p]: Self-supervised Speech Models Discover Phonological Vector Arithmetic
by: Choi, Kwanghee, et al.
Published: (2026)

Voice Biomarker Analysis and Automated Severity Classification of Dysarthric Speech in a Multilingual Context
by: Yeo, Eunjung
Published: (2024)

SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations
by: Meghanani, Amit, et al.
Published: (2024)

Prosodic ABX: A Language-Agnostic Method for Measuring Prosodic Contrast in Speech Representations
by: Sun, Haitong, et al.
Published: (2026)

Optimized Self-supervised Training with BEST-RQ for Speech Recognition
by: Baumann, Ilja, et al.
Published: (2025)

Self-supervised Speech Representations Still Struggle with African American Vernacular English
by: Chang, Kalvin, et al.
Published: (2024)

SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition
by: Xue, Hongfei, et al.
Published: (2023)

SS-DPPN: A self-supervised dual-path foundation model for the generalizable cardiac audio representation
by: Muna, Ummy Maria, et al.
Published: (2025)

Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features
by: Tsunoo, Emiru, et al.
Published: (2024)

Learning Domain-Robust Bioacoustic Representations for Mosquito Species Classification with Contrastive Learning and Distribution Alignment
by: Hou, Yuanbo, et al.
Published: (2025)

Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
by: Wang, Wupeng, et al.
Published: (2025)

SoundPlot: An Open-Source Framework for Birdsong Acoustic Analysis and Neural Synthesis with Interactive 3D Visualization
by: Mehdi, Naqcho Ali, et al.
Published: (2026)

STONE: Self-supervised Tonality Estimator
by: Kong, Yuexuan, et al.
Published: (2024)

LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks
by: Meghanani, Amit, et al.
Published: (2024)

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
by: Liu, Alexander H., et al.
Published: (2024)

Position-invariant Fine-tuning of Speech Enhancement Models with Self-supervised Speech Representations
by: Meghanani, Amit, et al.
Published: (2026)

Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations
by: Meghanani, Amit, et al.
Published: (2024)

Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations
by: Zaiem, Salah, et al.
Published: (2024)

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
by: Gong, Cheng, et al.
Published: (2023)

Selection of Layers from Self-supervised Learning Models for Predicting Mean-Opinion-Score of Speech
by: Liang, Xinyu, et al.
Published: (2025)

The Effect of Batch Size on Contrastive Self-Supervised Speech Representation Learning
by: Vaessen, Nik, et al.
Published: (2024)

Multi-Class-Token Transformer for Multitask Self-supervised Music Information Retrieval
by: Kong, Yuexuan, et al.
Published: (2025)

MMM: Multi-Layer Multi-Residual Multi-Stream Discrete Speech Representation from Self-supervised Learning Model
by: Shi, Jiatong, et al.
Published: (2024)

Boosting Multi-Speaker Expressive Speech Synthesis with Semi-supervised Contrastive Learning
by: Zhu, Xinfa, et al.
Published: (2023)

oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models
by: Dip, Muhammad Sudipto Siam, et al.
Published: (2024)