:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bhuyan, Amit Kumar, Dutta, Hrishikesh, Biswas, Subir
Format:	Preprint
Published:	2024
Subjects:	Sound Machine Learning Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2404.10842
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models
by: Wang, Quan, et al.
Published: (2024)

Investigating Confidence Estimation Measures for Speaker Diarization
by: Chowdhury, Anurag, et al.
Published: (2024)

Multi-Stage Speaker Diarization for Noisy Classrooms
by: Khan, Ali Sartaz, et al.
Published: (2025)

Language Modelling for Speaker Diarization in Telephonic Interviews
by: India, Miquel, et al.
Published: (2025)

MK-SGC-SC: Multiple Kernel Guided Sparse Graph Construction in Spectral Clustering for Unsupervised Speaker Diarization
by: Raghav, Nikhil, et al.
Published: (2026)

Self-Tuning Spectral Clustering for Speaker Diarization
by: Raghav, Nikhil, et al.
Published: (2024)

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
by: Wang, Quan, et al.
Published: (2022)

Speech Diarization and ASR with GMM
by: Sharma, Aayush Kumar, et al.
Published: (2023)

Zero Shot Audio to Audio Emotion Transfer With Speaker Disentanglement
by: Dutta, Soumya, et al.
Published: (2024)

Leveraging Self-Supervised Learning for Speaker Diarization
by: Han, Jiangyu, et al.
Published: (2024)

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
by: Lin, Weiwei, et al.
Published: (2024)

Pretraining Multi-Speaker Identification for Neural Speaker Diarization
by: Horiguchi, Shota, et al.
Published: (2025)

USED: Universal Speaker Extraction and Diarization
by: Ao, Junyi, et al.
Published: (2023)

Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation
by: Lu, Cheng, et al.
Published: (2024)

Assessing the Robustness of Spectral Clustering for Deep Speaker Diarization
by: Raghav, Nikhil, et al.
Published: (2024)

Uncertainty Quantification in Machine Learning for Joint Speaker Diarization and Identification
by: McKnight, Simon W., et al.
Published: (2023)

Mamba-based Segmentation Model for Speaker Diarization
by: Plaquet, Alexis, et al.
Published: (2024)

Speaker Embeddings With Weakly Supervised Voice Activity Detection For Efficient Speaker Diarization
by: Thienpondt, Jenthe, et al.
Published: (2024)

Can We Really Repurpose Multi-Speaker ASR Corpus for Speaker Diarization?
by: Horiguchi, Shota, et al.
Published: (2025)

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization
by: Cheng, Luyao, et al.
Published: (2024)

Adversarial Speaker Distillation for Countermeasure Model on Automatic Speaker Verification
by: Liao, Yen-Lun, et al.
Published: (2022)

Streaming Sortformer: Speaker Cache-Based Online Speaker Diarization with Arrival-Time Ordering
by: Medennikov, Ivan, et al.
Published: (2025)

HiddenSpeaker: Generate Imperceptible Unlearnable Audios for Speaker Verification System
by: Zhang, Zhisheng, et al.
Published: (2024)

Voice Signal Processing for Machine Learning. The Case of Speaker Isolation
by: Ganchev, Radan
Published: (2024)

Multiple Choice Learning for Efficient Speech Separation with Many Speakers
by: Perera, David, et al.
Published: (2024)

TinySV: Speaker Verification in TinyML with On-device Learning
by: Pavan, Massimo, et al.
Published: (2024)

Robust Channel Learning for Large-Scale Radio Speaker Verification
by: Yang, Wenhao, et al.
Published: (2024)

Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
by: Wu, Haibin, et al.
Published: (2021)

Self-Supervised Learning for Speaker Recognition: A study and review
by: Lepage, Theo, et al.
Published: (2026)

Bengali-Loop: Community Benchmarks for Long-Form Bangla ASR and Speaker Diarization
by: Tabib, H. M. Shadman, et al.
Published: (2026)

End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations
by: Morrone, Giovanni, et al.
Published: (2023)

Multi-stream Convolutional Neural Network with Frequency Selection for Robust Speaker Verification
by: Yao, Wei, et al.
Published: (2020)

LMD: A Learnable Mask Network to Detect Adversarial Examples for Speaker Verification
by: Chen, Xing, et al.
Published: (2022)

A Semi-Automatic Approach to Create Large Gender- and Age-Balanced Speaker Corpora: Usefulness of Speaker Diarization & Identification
by: Uro, Rémi, et al.
Published: (2024)

Text-Independent Speaker Identification Using Audio Looping With Margin Based Loss Functions
by: Garcia, Elliot Q C, et al.
Published: (2025)

Joint Training of Speaker Embedding Extractor, Speech and Overlap Detection for Diarization
by: Pálka, Petr, et al.
Published: (2024)

Additive Margin in Contrastive Self-Supervised Frameworks to Learn Discriminative Speaker Representations
by: Lepage, Theo, et al.
Published: (2024)

TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
by: Boeddeker, Christoph, et al.
Published: (2023)

Label-Efficient Self-Supervised Speaker Verification With Information Maximization and Contrastive Learning
by: Lepage, Théo, et al.
Published: (2022)

Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM
by: Sun, Zhaokai, et al.
Published: (2025)