Saved in:
| Main Authors: | Myoung, Jisoo, Han, Sangwook, Kim, Kihyuk, Shin, Jong Won |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.19721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Speech Enhancement based on cascaded two flows
by: Lee, Seonggyu, et al.
Published: (2025)
by: Lee, Seonggyu, et al.
Published: (2025)
SV-Mixer: Replacing the Transformer Encoder with Lightweight MLPs for Self-Supervised Model Compression in Speaker Verification
by: Heo, Jungwoo, et al.
Published: (2025)
by: Heo, Jungwoo, et al.
Published: (2025)
FlowSE: Flow Matching-based Speech Enhancement
by: Lee, Seonggyu, et al.
Published: (2025)
by: Lee, Seonggyu, et al.
Published: (2025)
Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models
by: Kim, Jin Sob, et al.
Published: (2024)
by: Kim, Jin Sob, et al.
Published: (2024)
Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
by: Sang, Mufan, et al.
Published: (2024)
by: Sang, Mufan, et al.
Published: (2024)
Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems
by: Chen, Zhengyang, et al.
Published: (2024)
by: Chen, Zhengyang, et al.
Published: (2024)
Speaker Disentanglement of Speech Pre-trained Model Based on Interpretability
by: Zhu, Xiaoxu, et al.
Published: (2025)
by: Zhu, Xiaoxu, et al.
Published: (2025)
Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment
by: Shao, Yiwen, et al.
Published: (2024)
by: Shao, Yiwen, et al.
Published: (2024)
Asymmetric Clean Segments-Guided Self-Supervised Learning for Robust Speaker Verification
by: Gan, Chong-Xin, et al.
Published: (2023)
by: Gan, Chong-Xin, et al.
Published: (2023)
NeXt-TDNN: Modernizing Multi-Scale Temporal Convolution Backbone for Speaker Verification
by: Heo, Hyun-Jun, et al.
Published: (2023)
by: Heo, Hyun-Jun, et al.
Published: (2023)
Speaker-Conditioned Phrase Break Prediction for Text-to-Speech with Phoneme-Level Pre-trained Language Model
by: Yang, Dong, et al.
Published: (2025)
by: Yang, Dong, et al.
Published: (2025)
SA-WavLM: Speaker-Aware Self-Supervised Pre-training for Mixture Speech
by: Lin, Jingru, et al.
Published: (2024)
by: Lin, Jingru, et al.
Published: (2024)
FUN-SSL: Full-band Layer Followed by U-Net with Narrow-band Layers for Multiple Moving Sound Source Localization
by: Choi, Yuseon, et al.
Published: (2025)
by: Choi, Yuseon, et al.
Published: (2025)
Joint Optimization of Speaker and Spoof Detectors for Spoofing-Robust Automatic Speaker Verification
by: Kurnaz, Oğuzhan, et al.
Published: (2025)
by: Kurnaz, Oğuzhan, et al.
Published: (2025)
Learning Emotion-Invariant Speaker Representations for Speaker Verification
by: Tian, Jingguang, et al.
Published: (2025)
by: Tian, Jingguang, et al.
Published: (2025)
Investigating the Potential of Multi-Stage Score Fusion in Spoofing-Aware Speaker Verification
by: Kurnaz, Oguzhan, et al.
Published: (2025)
by: Kurnaz, Oguzhan, et al.
Published: (2025)
An Age-Agnostic System for Robust Speaker Verification
by: Zheng, Jiusi, et al.
Published: (2025)
by: Zheng, Jiusi, et al.
Published: (2025)
Hybrid Pruning: In-Situ Compression of Self-Supervised Speech Models for Speaker Verification and Anti-Spoofing
by: Peng, Junyi, et al.
Published: (2025)
by: Peng, Junyi, et al.
Published: (2025)
ERes2NetV2: Boosting Short-Duration Speaker Verification Performance with Computational Efficiency
by: Chen, Yafeng, et al.
Published: (2024)
by: Chen, Yafeng, et al.
Published: (2024)
UniPET-SPK: A Unified Framework for Parameter-Efficient Tuning of Pre-trained Speech Models for Robust Speaker Verification
by: Sang, Mufan, et al.
Published: (2025)
by: Sang, Mufan, et al.
Published: (2025)
Integrated Multi-Level Knowledge Distillation for Enhanced Speaker Verification
by: Yang, Wenhao, et al.
Published: (2024)
by: Yang, Wenhao, et al.
Published: (2024)
Effective Modeling of Critical Contextual Information for TDNN-based Speaker Verification
by: Weng, Shilong, et al.
Published: (2025)
by: Weng, Shilong, et al.
Published: (2025)
Towards Unsupervised Speaker Diarization System for Multilingual Telephone Calls Using Pre-trained Whisper Model and Mixture of Sparse Autoencoders
by: Lam, Phat, et al.
Published: (2024)
by: Lam, Phat, et al.
Published: (2024)
Spoofing-Aware Speaker Verification via Wavelet Prompt Tuning and Multi-Model Ensembles
by: Farhadipour, Aref, et al.
Published: (2026)
by: Farhadipour, Aref, et al.
Published: (2026)
An Adaptive X-vector Model for Text-independent Speaker Verification
by: Gu, Bin, et al.
Published: (2020)
by: Gu, Bin, et al.
Published: (2020)
Trainable Adaptive Score Normalization for Automatic Speaker Verification
by: Choi, Jeong-Hwan, et al.
Published: (2025)
by: Choi, Jeong-Hwan, et al.
Published: (2025)
Optimizing a-DCF for Spoofing-Robust Speaker Verification
by: Kurnaz, Oğuzhan, et al.
Published: (2024)
by: Kurnaz, Oğuzhan, et al.
Published: (2024)
LG Uplus System with Multi-Speaker IDs and Discriminator-based Sub-Judges for the WildSpoof Challenge
by: Park, Jinyoung, et al.
Published: (2025)
by: Park, Jinyoung, et al.
Published: (2025)
Adversarial Reweighting for Speaker Verification Fairness
by: Jin, Minho, et al.
Published: (2022)
by: Jin, Minho, et al.
Published: (2022)
Prototype and Instance Contrastive Learning for Unsupervised Domain Adaptation in Speaker Verification
by: Huang, Wen, et al.
Published: (2024)
by: Huang, Wen, et al.
Published: (2024)
Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation
by: Shin, Ui-Hyeop, et al.
Published: (2024)
by: Shin, Ui-Hyeop, et al.
Published: (2024)
Unifying Diarization, Separation, and ASR with Multi-Speaker Encoder
by: Shakeel, Muhammad, et al.
Published: (2025)
by: Shakeel, Muhammad, et al.
Published: (2025)
Whisper-PMFA: Partial Multi-Scale Feature Aggregation for Speaker Verification using Whisper Models
by: Zhao, Yiyang, et al.
Published: (2024)
by: Zhao, Yiyang, et al.
Published: (2024)
Bayesian Learning for Domain-Invariant Speaker Verification and Anti-Spoofing
by: Li, Jin, et al.
Published: (2025)
by: Li, Jin, et al.
Published: (2025)
DAME: Duration-Aware Matryoshka Embedding for Duration-Robust Speaker Verification
by: Jung, Youngmoon, et al.
Published: (2026)
by: Jung, Youngmoon, et al.
Published: (2026)
3D-Speaker-Toolkit: An Open-Source Toolkit for Multimodal Speaker Verification and Diarization
by: Chen, Yafeng, et al.
Published: (2024)
by: Chen, Yafeng, et al.
Published: (2024)
Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
by: Truong, Duc-Tuan, et al.
Published: (2023)
by: Truong, Duc-Tuan, et al.
Published: (2023)
Towards Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training
by: Yang, Yifan, et al.
Published: (2026)
by: Yang, Yifan, et al.
Published: (2026)
Diffusion-Based Adversarial Purification for Speaker Verification
by: Bai, Yibo, et al.
Published: (2023)
by: Bai, Yibo, et al.
Published: (2023)
PhiNet: Speaker Verification with Phonetic Interpretability
by: Ma, Yi, et al.
Published: (2026)
by: Ma, Yi, et al.
Published: (2026)
Similar Items
-
Speech Enhancement based on cascaded two flows
by: Lee, Seonggyu, et al.
Published: (2025) -
SV-Mixer: Replacing the Transformer Encoder with Lightweight MLPs for Self-Supervised Model Compression in Speaker Verification
by: Heo, Jungwoo, et al.
Published: (2025) -
FlowSE: Flow Matching-based Speech Enhancement
by: Lee, Seonggyu, et al.
Published: (2025) -
Layer-aware TDNN: Speaker Recognition Using Multi-Layer Features from Pre-Trained Models
by: Kim, Jin Sob, et al.
Published: (2024) -
Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification
by: Sang, Mufan, et al.
Published: (2024)