:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Yuang, Zhang, Min, Su, Chang, Li, Yinglu, Qiao, Xiaosong, Ren, Mengxin, Ma, Miaomiao, Wei, Daimeng, Tao, Shimin, Yang, Hao
Format:	Preprint
Published:	2023
Subjects:	Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2309.09552
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Cross-Domain Audio Deepfake Detection: Dataset and Analysis
by: Li, Yuang, et al.
Published: (2024)

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction
by: Guo, Jiaxin, et al.
Published: (2024)

Using Large Language Model for End-to-End Chinese ASR and NER
by: Li, Yuang, et al.
Published: (2024)

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
by: Sudo, Yui, et al.
Published: (2025)

MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2026)

WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing
by: Nakagome, Yu, et al.
Published: (2025)

No Word Left Behind: Mitigating Prefix Bias in Open-Vocabulary Keyword Spotting
by: Liu, Yi, et al.
Published: (2026)

Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2025)

PCOV-KWS: Multi-task Learning for Personalized Customizable Open Vocabulary Keyword Spotting
by: Pan, Jianan, et al.
Published: (2026)

Two Intermediate Translations Are Better Than One: Fine-tuning LLMs for Document-level Translation Refinement
by: Dong, Yichen, et al.
Published: (2025)

Improving LLM-based Document-level Machine Translation with Multi-Knowledge Fusion
by: Liu, Bin, et al.
Published: (2025)

DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators
by: Lyu, Xinglin, et al.
Published: (2024)

Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model
by: Lall, Vishakha, et al.
Published: (2024)

Improving Synthetic Data Training for Contextual Biasing Models with a Keyword-Aware Cost Function
by: Kwok, Chin Yuen, et al.
Published: (2025)

Disentangled Training with Adversarial Examples For Robust Small-footprint Keyword Spotting
by: Wang, Zhenyu, et al.
Published: (2024)

NTC-KWS: Noise-aware CTC for Robust Keyword Spotting
by: Xi, Yu, et al.
Published: (2024)

Large Language Model Should Understand Pinyin for Chinese ASR Error Correction
by: Li, Yuang, et al.
Published: (2024)

Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency
by: Xi, Yu, et al.
Published: (2024)

Contrastive Augmentation: An Unsupervised Learning Approach for Keyword Spotting in Speech Technology
by: Dai, Weinan, et al.
Published: (2024)

Text-aware Speech Separation for Multi-talker Keyword Spotting
by: Li, Haoyu, et al.
Published: (2024)

Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation
by: Li, Ying, et al.
Published: (2026)

Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM
by: Yu, Jiawei, et al.
Published: (2024)

Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech
by: Xi, Yu, et al.
Published: (2024)

TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer
by: Xi, Yu, et al.
Published: (2024)

Keyword Mamba: Spoken Keyword Spotting with State Space Models
by: Ding, Hanyu, et al.
Published: (2025)

Contextual Biasing for Streaming ASR via CTC-based Word Spotting
by: Tsai, Kai-Chen, et al.
Published: (2026)

Few-Shot Keyword Spotting from Mixed Speech
by: Yuan, Junming, et al.
Published: (2024)

"I've Heard of You!": Generate Spoken Named Entity Recognition Data for Unseen Entities
by: Yu, Jiawei, et al.
Published: (2024)

Loong: A Human-Like Long Document Translation Agent with Observe-and-Act Adaptive Context Selection
by: Wang, Yutong, et al.
Published: (2026)

Effective Integration of KAN for Keyword Spotting
by: Xu, Anfeng, et al.
Published: (2024)

Perforated Neural Networks for Keyword Spotting
by: Gopal, Vishy, et al.
Published: (2026)

Multichannel Keyword Spotting for Noisy Conditions
by: Saladukha, Dzmitry, et al.
Published: (2025)

Sparse Binarization for Fast Keyword Spotting
by: Svirsky, Jonathan, et al.
Published: (2024)

Dark Experience for Incremental Keyword Spotting
by: Peng, Tianyi, et al.
Published: (2024)

CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting
by: Jin, Sichen, et al.
Published: (2024)

Mass-Spring Models for Passive Keyword Spotting: A Springtronics Approach
by: Bohte, Finn, et al.
Published: (2025)

Relational Proxy Loss for Audio-Text based Keyword Spotting
by: Jung, Youngmoon, et al.
Published: (2024)

Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems
by: Ren, Bo, et al.
Published: (2025)

MT-HuBERT: Self-Supervised Mix-Training for Few-Shot Keyword Spotting in Mixed Speech
by: Yuan, Junming, et al.
Published: (2025)

Noise-Agnostic Multitask Whisper Training for Reducing False Alarm Errors in Call-for-Help Detection
by: Ryu, Myeonghoon, et al.
Published: (2025)