:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Yu, Wang, Yiming, Cai, Runyuan, Zeng, Xiaodong
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.28139
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers
by: Cai, Runyuan, et al.
Published: (2026)

Tiny-Engram: Trigger-Indexed Concept Tables for Generative Vision
by: Cai, Runyuan, et al.
Published: (2026)

ASKD-Whisper: Adaptive Self-knowledge Distillation for Efficient and Low-Latency Automatic Speech Recognition
by: Lee, Junseok, et al.
Published: (2026)

Demonstration of Adapt4Me: An Uncertainty-Aware Authoring Environment for Personalizing Automatic Speech Recognition to Non-normative Speech
by: Pokel, Niclas, et al.
Published: (2026)

RAS: a Reliability Oriented Metric for Automatic Speech Recognition
by: Huang, Wenbin, et al.
Published: (2026)

Flow-OPD: On-Policy Distillation for Flow Matching Models
by: Fang, Zhen, et al.
Published: (2026)

Automatic Speech Recognition for Sanskrit with Transfer Learning
by: Sadhukhan, Bidit, et al.
Published: (2025)

Automatic Speech Recognition for Greek Medical Dictation
by: Georgilas, Vardis, et al.
Published: (2025)

Doing More with Less: Data Augmentation for Sudanese Dialect Automatic Speech Recognition
by: Mansour, Ayman
Published: (2026)

Enabling Automatic Disordered Speech Recognition: An Impaired Speech Dataset in the Akan Language
by: Wiafe, Isaac, et al.
Published: (2026)

Breaking Resource Barriers in Speech Emotion Recognition via Data Distillation
by: Chang, Yi, et al.
Published: (2024)

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
by: Kamahori, Keisuke, et al.
Published: (2025)

Augmenting Automatic Speech Recognition Models with Disfluency Detection
by: Amann, Robin, et al.
Published: (2024)

Efficient Speech Command Recognition Leveraging Spiking Neural Network and Curriculum Learning-based Knowledge Distillation
by: Wang, Jiaqi, et al.
Published: (2024)

Speech Retrieval-Augmented Generation without Automatic Speech Recognition
by: Min, Do June, et al.
Published: (2024)

Handling Numeric Expressions in Automatic Speech Recognition
by: Huber, Christian, et al.
Published: (2024)

Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information
by: Taguchi, Chihiro, et al.
Published: (2024)

ClST: A Convolutional Transformer Framework for Automatic Modulation Recognition by Knowledge Distillation
by: Hou, Dongbin, et al.
Published: (2023)

Lightning OPD: Efficient Post-Training for Large Reasoning Models with Offline On-Policy Distillation
by: Wu, Yecheng, et al.
Published: (2026)

Semantically Corrected Amharic Automatic Speech Recognition
by: Adnew, Samuael, et al.
Published: (2024)

Benchmarking Rotary Position Embeddings for Automatic Speech Recognition
by: Zhang, Shucong, et al.
Published: (2025)

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
by: Jain, Yash, et al.
Published: (2024)

Fairness of Automatic Speech Recognition: Looking Through a Philosophical Lens
by: Choi, Anna Seo Gyeong, et al.
Published: (2025)

Error-preserving Automatic Speech Recognition of Young English Learners' Language
by: Michot, Janick, et al.
Published: (2024)

Automatic Speech Recognition in the Modern Era: Architectures, Training, and Evaluation
by: Nayeem, Md., et al.
Published: (2025)

Speech Recognition-based Feature Extraction for Enhanced Automatic Severity Classification in Dysarthric Speech
by: Choi, Yerin, et al.
Published: (2024)

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations
by: Shome, Debaditya, et al.
Published: (2023)

AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition
by: Bao, Chen, et al.
Published: (2025)

Asymmetric On-Policy Distillation: Bridging Exploitation and Imitation at the Token Level
by: Jia, Nan, et al.
Published: (2026)

Explanova: Automatically Discover Data Insights in N \times M Table via XAI Combined LLM Workflow
by: Huang, Yiming
Published: (2026)

Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan
by: Taguchi, Chihiro, et al.
Published: (2026)

A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
by: Obaidah, Qusai Abo, et al.
Published: (2024)

SCOPE: Signal-Calibrated On-Policy Distillation Enhancement with Dual-Path Adaptive Weighting
by: Zheng, Binbin, et al.
Published: (2026)

Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring
by: Sudarshan, Ankitha, et al.
Published: (2023)

Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
by: Qin, Ruiyang, et al.
Published: (2024)

Serialized Speech Information Guidance with Overlapped Encoding Separation for Multi-Speaker Automatic Speech Recognition
by: Shi, Hao, et al.
Published: (2024)

MORE: Multi-Objective Adversarial Attacks on Speech Recognition
by: Gao, Xiaoxue, et al.
Published: (2026)

X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs
by: Cao, Di, et al.
Published: (2026)

Reasoning Compression with Mixed-Policy Distillation
by: Yang, Han, et al.
Published: (2026)

Interpreting Pretrained Speech Models for Automatic Speech Assessment of Voice Disorders
by: Lau, Hok-Shing, et al.
Published: (2024)