:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Fu, Li, Yu, Shanyong, Li, Siqi, Fan, Lu, Wu, Youzheng, He, Xiaodong
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Audio and Speech Processing
Accesso online:	https://arxiv.org/abs/2412.17507
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

PAC: Pronunciation-Aware Contextualized Large Language Model-based Automatic Speech Recognition
di: Fu, Li, et al.
Pubblicazione: (2025)

MOPSA: Mixture of Prompt-Experts Based Speaker Adaptation for Elderly Speech Recognition
di: Deng, Chengxi, et al.
Pubblicazione: (2025)

CAMEL: Cross-Attention Enhanced Mixture-of-Experts and Language Bias for Code-Switching Speech Recognition
di: Wang, He, et al.
Pubblicazione: (2024)

FNH-TTS: Mixture-of-Experts Duration Modeling for Robust Neural Speech Synthesis
di: Meng, Qingliang, et al.
Pubblicazione: (2025)

Enhancing Code-Switching Speech Recognition with LID-Based Collaborative Mixture of Experts Model
di: Huang, Hukai, et al.
Pubblicazione: (2024)

Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech
di: Bhattacharjee, Susmita, et al.
Pubblicazione: (2025)

Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models
di: Lin, Yuke, et al.
Pubblicazione: (2025)

Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
di: Park, ChaeHun, et al.
Pubblicazione: (2024)

FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition
di: Kim, Jongsuk, et al.
Pubblicazione: (2025)

Speaker Attributed Automatic Speech Recognition Using Speech Aware LLMS
di: Aronowitz, Hagai, et al.
Pubblicazione: (2026)

Robust Audiovisual Speech Recognition Models with Mixture-of-Experts
di: Wu, Yihan, et al.
Pubblicazione: (2024)

Unsupervised Online Continual Learning for Automatic Speech Recognition
di: Eeckt, Steven Vander, et al.
Pubblicazione: (2024)

Using Songs to Improve Kazakh Automatic Speech Recognition
di: Yeshpanov, Rustem
Pubblicazione: (2026)

The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge
di: Tian, Jingguang, et al.
Pubblicazione: (2024)

The DKU System for Multi-Speaker Automatic Speech Recognition in MLC-SLM Challenge
di: Lin, Yuke, et al.
Pubblicazione: (2025)

Unifying Speech Recognition, Synthesis and Conversion with Autoregressive Transformers
di: Cai, Runyuan, et al.
Pubblicazione: (2026)

Non-Intrusive Automatic Speech Recognition Refinement: A Survey
di: Peyghan, Mohammad Reza, et al.
Pubblicazione: (2025)

Joint Learning using Mixture-of-Expert-Based Representation for Speech Enhancement and Robust Emotion Recognition
di: Tzeng, Jing-Tong, et al.
Pubblicazione: (2025)

SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation
di: Du, Jiayu, et al.
Pubblicazione: (2024)

Mixture of LoRA Experts with Multi-Modal and Multi-Granularity LLM Generative Error Correction for Accented Speech Recognition
di: Mu, Bingshen, et al.
Pubblicazione: (2025)

Dynamic Data Pruning for Automatic Speech Recognition
di: Xiao, Qiao, et al.
Pubblicazione: (2024)

Too Good to Be True: A Study on Modern Automatic Speech Recognition for the Evaluation of Speech Enhancement
di: de Oliveira, Danilo, et al.
Pubblicazione: (2026)

Group-Aware Partial Model Merging for Children's Automatic Speech Recognition
di: Rolland, Thomas, et al.
Pubblicazione: (2025)

Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding
di: Hu, Jiliang, et al.
Pubblicazione: (2025)

Multi-Scale Temporal Transformer For Speech Emotion Recognition
di: Li, Zhipeng, et al.
Pubblicazione: (2024)

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition
di: Dai, Yuhang, et al.
Pubblicazione: (2025)

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages
di: Meng, Yangyang, et al.
Pubblicazione: (2025)

Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
di: Wang, Pu, et al.
Pubblicazione: (2024)

Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models
di: Polok, Alexander, et al.
Pubblicazione: (2024)

Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition
di: Eeckt, Steven Vander, et al.
Pubblicazione: (2022)

Adaptive Mixture of Low-Rank Experts for Robust Audio Spoofing Detection
di: Chen, Qixian, et al.
Pubblicazione: (2025)

Enhancing Automatic Chord Recognition through LLM Chain-of-Thought Reasoning
di: Chang, Chih-Cheng, et al.
Pubblicazione: (2025)

SpecASR: Accelerating LLM-based Automatic Speech Recognition via Speculative Decoding
di: Wei, Linye, et al.
Pubblicazione: (2025)

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction
di: Guo, Jiaxin, et al.
Pubblicazione: (2024)

Automatic Speech Recognition for Hindi
di: Saha, Anish, et al.
Pubblicazione: (2024)

Zero-Shot Recognition of Dysarthric Speech Using Commercial Automatic Speech Recognition and Multimodal Large Language Models
di: Alsayegh, Ali, et al.
Pubblicazione: (2025)

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
di: Xue, Hongfei, et al.
Pubblicazione: (2024)

Augmenting Polish Automatic Speech Recognition System With Synthetic Data
di: Bondaruk, Łukasz, et al.
Pubblicazione: (2024)

Leveraging Self-Supervised Models for Automatic Whispered Speech Recognition
di: Farhadipour, Aref, et al.
Pubblicazione: (2024)

Rehearsal-Free Online Continual Learning for Automatic Speech Recognition
di: Eeckt, Steven Vander, et al.
Pubblicazione: (2023)