Saved in:
| Main Authors: | Li, Yuang, Zhang, Min, Su, Chang, Li, Yinglu, Qiao, Xiaosong, Ren, Mengxin, Ma, Miaomiao, Wei, Daimeng, Tao, Shimin, Yang, Hao |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2309.09552 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cross-Domain Audio Deepfake Detection: Dataset and Analysis
by: Li, Yuang, et al.
Published: (2024)
by: Li, Yuang, et al.
Published: (2024)
UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction
by: Guo, Jiaxin, et al.
Published: (2024)
by: Guo, Jiaxin, et al.
Published: (2024)
Using Large Language Model for End-to-End Chinese ASR and NER
by: Li, Yuang, et al.
Published: (2024)
by: Li, Yuang, et al.
Published: (2024)
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
by: Sudo, Yui, et al.
Published: (2025)
by: Sudo, Yui, et al.
Published: (2025)
MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2026)
by: Jung, Youngmoon, et al.
Published: (2026)
WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing
by: Nakagome, Yu, et al.
Published: (2025)
by: Nakagome, Yu, et al.
Published: (2025)
No Word Left Behind: Mitigating Prefix Bias in Open-Vocabulary Keyword Spotting
by: Liu, Yi, et al.
Published: (2026)
by: Liu, Yi, et al.
Published: (2026)
Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2025)
by: Jung, Youngmoon, et al.
Published: (2025)
PCOV-KWS: Multi-task Learning for Personalized Customizable Open Vocabulary Keyword Spotting
by: Pan, Jianan, et al.
Published: (2026)
by: Pan, Jianan, et al.
Published: (2026)
Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement
by: Dong, Yichen, et al.
Published: (2025)
by: Dong, Yichen, et al.
Published: (2025)
Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
by: Liu, Bin, et al.
Published: (2025)
by: Liu, Bin, et al.
Published: (2025)
DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators
by: Lyu, Xinglin, et al.
Published: (2024)
by: Lyu, Xinglin, et al.
Published: (2024)
Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
by: Lall, Vishakha, et al.
Published: (2024)
by: Lall, Vishakha, et al.
Published: (2024)
Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
by: Kwok, Chin Yuen, et al.
Published: (2025)
by: Kwok, Chin Yuen, et al.
Published: (2025)
Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
by: Wang, Zhenyu, et al.
Published: (2024)
by: Wang, Zhenyu, et al.
Published: (2024)
NTC-KWS: Noise-aware CTC for Robust Keyword Spotting
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
Large Language Model Should Understand Pinyin for Chinese ASR Error Correction
by: Li, Yuang, et al.
Published: (2024)
by: Li, Yuang, et al.
Published: (2024)
Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology
by: Dai, Weinan, et al.
Published: (2024)
by: Dai, Weinan, et al.
Published: (2024)
Text-aware Speech Separation for Multi-talker Keyword Spotting
by: Li, Haoyu, et al.
Published: (2024)
by: Li, Haoyu, et al.
Published: (2024)
Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation
by: Li, Ying, et al.
Published: (2026)
by: Li, Ying, et al.
Published: (2026)
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
by: Yu, Jiawei, et al.
Published: (2024)
by: Yu, Jiawei, et al.
Published: (2024)
Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer
by: Xi, Yu, et al.
Published: (2024)
by: Xi, Yu, et al.
Published: (2024)
Keyword Mamba: Spoken Keyword Spotting with State Space Models
by: Ding, Hanyu, et al.
Published: (2025)
by: Ding, Hanyu, et al.
Published: (2025)
Contextual Biasing for Streaming ASR via CTC-based Word Spotting
by: Tsai, Kai-Chen, et al.
Published: (2026)
by: Tsai, Kai-Chen, et al.
Published: (2026)
Few-Shot Keyword Spotting from Mixed Speech
by: Yuan, Junming, et al.
Published: (2024)
by: Yuan, Junming, et al.
Published: (2024)
"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
by: Yu, Jiawei, et al.
Published: (2024)
by: Yu, Jiawei, et al.
Published: (2024)
Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection
by: Wang, Yutong, et al.
Published: (2026)
by: Wang, Yutong, et al.
Published: (2026)
Effective Integration of KAN for Keyword Spotting
by: Xu, Anfeng, et al.
Published: (2024)
by: Xu, Anfeng, et al.
Published: (2024)
Perforated Neural Networks for Keyword Spotting
by: Gopal, Vishy, et al.
Published: (2026)
by: Gopal, Vishy, et al.
Published: (2026)
Multichannel Keyword Spotting for Noisy Conditions
by: Saladukha, Dzmitry, et al.
Published: (2025)
by: Saladukha, Dzmitry, et al.
Published: (2025)
Sparse Binarization for Fast Keyword Spotting
by: Svirsky, Jonathan, et al.
Published: (2024)
by: Svirsky, Jonathan, et al.
Published: (2024)
Dark Experience for Incremental Keyword Spotting
by: Peng, Tianyi, et al.
Published: (2024)
by: Peng, Tianyi, et al.
Published: (2024)
CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting
by: Jin, Sichen, et al.
Published: (2024)
by: Jin, Sichen, et al.
Published: (2024)
Mass-Spring Models for Passive Keyword Spotting: A Springtronics Approach
by: Bohte, Finn, et al.
Published: (2025)
by: Bohte, Finn, et al.
Published: (2025)
Relational Proxy Loss for Audio-Text based Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2024)
by: Jung, Youngmoon, et al.
Published: (2024)
Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems
by: Ren, Bo, et al.
Published: (2025)
by: Ren, Bo, et al.
Published: (2025)
MT-HuBERT: Self-Supervised Mix-Training for Few-Shot Keyword Spotting in Mixed Speech
by: Yuan, Junming, et al.
Published: (2025)
by: Yuan, Junming, et al.
Published: (2025)
Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
by: Ryu, Myeonghoon, et al.
Published: (2025)
by: Ryu, Myeonghoon, et al.
Published: (2025)
Similar Items
-
Cross-Domain Audio Deepfake Detection: Dataset and Analysis
by: Li, Yuang, et al.
Published: (2024) -
UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction
by: Guo, Jiaxin, et al.
Published: (2024) -
Using Large Language Model for End-to-End Chinese ASR and NER
by: Li, Yuang, et al.
Published: (2024) -
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
by: Sudo, Yui, et al.
Published: (2025) -
MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2026)