Saved in:
| Main Authors: | Ai, Zhiqi, Cheng, Han, Wang, Yuxin, Mu, Shiyi, Xu, Shugong, Zhou, Yongjin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.10740 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Effective User-defined Keyword Spotting with Dual-stage Matching, Multi-modal Enrollment, and Continual Adaptation
by: Ai, Zhiqi, et al.
Published: (2026)
by: Ai, Zhiqi, et al.
Published: (2026)
MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting
by: Ai, Zhiqi, et al.
Published: (2024)
by: Ai, Zhiqi, et al.
Published: (2024)
Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment
by: Kewei, Li, et al.
Published: (2024)
by: Kewei, Li, et al.
Published: (2024)
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample
by: Chen, Zhiyong, et al.
Published: (2024)
by: Chen, Zhiyong, et al.
Published: (2024)
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis
by: Chen, Zhiyong, et al.
Published: (2024)
by: Chen, Zhiyong, et al.
Published: (2024)
End-to-End User-Defined Keyword Spotting using Shifted Delta Coefficients
by: V, Kesavaraj, et al.
Published: (2024)
by: V, Kesavaraj, et al.
Published: (2024)
VoxAging: Continuously Tracking Speaker Aging with a Large-Scale Longitudinal Dataset in English and Mandarin
by: Ai, Zhiqi, et al.
Published: (2025)
by: Ai, Zhiqi, et al.
Published: (2025)
Robust Dual-Modal Speech Keyword Spotting for XR Headsets
by: Cai, Zhuojiang, et al.
Published: (2024)
by: Cai, Zhuojiang, et al.
Published: (2024)
NTC-KWS: Noise-aware CTC for Robust Keyword Spotting
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
AdaKWS: Towards Robust Keyword Spotting with Test-Time Adaptation
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
Effective Integration of KAN for Keyword Spotting
by: Xu, Anfeng, et al.
Published: (2024)
by: Xu, Anfeng, et al.
Published: (2024)
Few-Shot Keyword Spotting from Mixed Speech
by: Yuan, Junming, et al.
Published: (2024)
by: Yuan, Junming, et al.
Published: (2024)
Keyword Mamba: Spoken Keyword Spotting with State Space Models
by: Ding, Hanyu, et al.
Published: (2025)
by: Ding, Hanyu, et al.
Published: (2025)
Multichannel Keyword Spotting for Noisy Conditions
by: Saladukha, Dzmitry, et al.
Published: (2025)
by: Saladukha, Dzmitry, et al.
Published: (2025)
Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
by: Gundluru, Ramesh, et al.
Published: (2025)
by: Gundluru, Ramesh, et al.
Published: (2025)
Adversarial training of Keyword Spotting to Minimize TTS Data Overfitting
by: Park, Hyun Jin, et al.
Published: (2024)
by: Park, Hyun Jin, et al.
Published: (2024)
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
by: Wang, Zhenyu, et al.
Published: (2024)
by: Wang, Zhenyu, et al.
Published: (2024)
Utilizing TTS Synthesized Data for Efficient Development of Keyword Spotting Model
by: Park, Hyun Jin, et al.
Published: (2024)
by: Park, Hyun Jin, et al.
Published: (2024)
Sparse Binarization for Fast Keyword Spotting
by: Svirsky, Jonathan, et al.
Published: (2024)
by: Svirsky, Jonathan, et al.
Published: (2024)
No Word Left Behind: Mitigating Prefix Bias in Open-Vocabulary Keyword Spotting
by: Liu, Yi, et al.
Published: (2026)
by: Liu, Yi, et al.
Published: (2026)
Noise-Robust Keyword Spotting through Self-supervised Pretraining
by: Mørk, Jacob, et al.
Published: (2024)
by: Mørk, Jacob, et al.
Published: (2024)
LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting
by: Zhu, Pai, et al.
Published: (2025)
by: Zhu, Pai, et al.
Published: (2025)
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
EdgeSpot: Efficient and High-Performance Few-Shot Model for Keyword Spotting
by: Buyuksolak, Oguzhan, et al.
Published: (2026)
by: Buyuksolak, Oguzhan, et al.
Published: (2026)
Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
Efficient Continual Learning in Keyword Spotting using Binary Neural Networks
by: Vu, Quynh Nguyen-Phuong, et al.
Published: (2025)
by: Vu, Quynh Nguyen-Phuong, et al.
Published: (2025)
MT-HuBERT: Self-Supervised Mix-Training for Few-Shot Keyword Spotting in Mixed Speech
by: Yuan, Junming, et al.
Published: (2025)
by: Yuan, Junming, et al.
Published: (2025)
CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting
by: Jin, Sichen, et al.
Published: (2024)
by: Jin, Sichen, et al.
Published: (2024)
Synaspot: A Lightweight, Streaming Multi-modal Framework for Keyword Spotting with Audio-Text Synergy
by: Li, Kewei, et al.
Published: (2025)
by: Li, Kewei, et al.
Published: (2025)
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
Domain-Incremental Continual Learning for Robust and Efficient Keyword Spotting in Resource Constrained Systems
by: Dhungana, Prakash, et al.
Published: (2026)
by: Dhungana, Prakash, et al.
Published: (2026)
Keyword Spotting with Hyper-Matched Filters for Small Footprint Devices
by: Segal-Feldman, Yael, et al.
Published: (2025)
by: Segal-Feldman, Yael, et al.
Published: (2025)
MFA-KWS: Effective Keyword Spotting with Multi-head Frame-asynchronous Decoding
by: Xi, Yu, et al.
Published: (2025)
by: Xi, Yu, et al.
Published: (2025)
Masked Self-distilled Transducer-based Keyword Spotting with Semi-autoregressive Decoding
by: Xi, Yu, et al.
Published: (2025)
by: Xi, Yu, et al.
Published: (2025)
Frequency & Channel Attention Network for Small Footprint Noisy Spoken Keyword Spotting
by: Lin, Yuanxi, et al.
Published: (2024)
by: Lin, Yuanxi, et al.
Published: (2024)
End-to-End Efficiency in Keyword Spotting: A System-Level Approach for Embedded Microcontrollers
by: Bartoli, Pietro, et al.
Published: (2025)
by: Bartoli, Pietro, et al.
Published: (2025)
Adaptive Noise Resilient Keyword Spotting Using One-Shot Learning
by: Martinez-Rau, Luciano Sebastian, et al.
Published: (2025)
by: Martinez-Rau, Luciano Sebastian, et al.
Published: (2025)
Multi-Sample Dynamic Time Warping for Few-Shot Keyword Spotting
by: Wilkinghoff, Kevin, et al.
Published: (2024)
by: Wilkinghoff, Kevin, et al.
Published: (2024)
OnDA: On-device Channel Pruning for Efficient Personalized Keyword Spotting
by: Risso, Matteo, et al.
Published: (2026)
by: Risso, Matteo, et al.
Published: (2026)
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology
by: Dai, Weinan, et al.
Published: (2024)
by: Dai, Weinan, et al.
Published: (2024)
Similar Items
-
Effective User-defined Keyword Spotting with Dual-stage Matching, Multi-modal Enrollment, and Continual Adaptation
by: Ai, Zhiqi, et al.
Published: (2026) -
MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting
by: Ai, Zhiqi, et al.
Published: (2024) -
Phoneme-Level Contrastive Learning for User-Defined Keyword Spotting with Flexible Enrollment
by: Kewei, Li, et al.
Published: (2024) -
Enhancing Open-Set Speaker Identification through Rapid Tuning with Speaker Reciprocal Points and Negative Sample
by: Chen, Zhiyong, et al.
Published: (2024) -
StyleFusion TTS: Multimodal Style-control and Enhanced Feature Fusion for Zero-shot Text-to-speech Synthesis
by: Chen, Zhiyong, et al.
Published: (2024)