Saved in:
| Main Authors: | Ron, Yonathan, Gilboa, Shiri, Dubnov, Tammuz |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18966 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Gradient Correlation Subspace Learning against Catastrophic Forgetting
by: Dubnov, Tammuz, et al.
Published: (2024)
by: Dubnov, Tammuz, et al.
Published: (2024)
Fine-tuning Whisper for Pashto ASR: strategies and scale
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Extending Whisper with prompt tuning to target-speaker ASR
by: Ma, Hao, et al.
Published: (2023)
by: Ma, Hao, et al.
Published: (2023)
Binaural sound source localization using a hybrid time and frequency domain model
by: Geva, Gil, et al.
Published: (2024)
by: Geva, Gil, et al.
Published: (2024)
A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
by: Song, Zheshu, et al.
Published: (2024)
by: Song, Zheshu, et al.
Published: (2024)
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR
by: Song, Zheshu, et al.
Published: (2024)
by: Song, Zheshu, et al.
Published: (2024)
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper
by: Thorbecke, Iuliia, et al.
Published: (2024)
by: Thorbecke, Iuliia, et al.
Published: (2024)
Quantizing Whisper-small: How design choices affect ASR performance
by: Söhler, Arthur, et al.
Published: (2025)
by: Söhler, Arthur, et al.
Published: (2025)
WhisperKit: On-device Real-time ASR with Billion-Scale Transformers
by: Orhon, Atila, et al.
Published: (2025)
by: Orhon, Atila, et al.
Published: (2025)
On the Role of Encoder Depth: Pruning Whisper and LoRA Fine-Tuning in SLAM-ASR
by: Kolluri, Ganesh Pavan Kartikeya Bharadwaj, et al.
Published: (2026)
by: Kolluri, Ganesh Pavan Kartikeya Bharadwaj, et al.
Published: (2026)
Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning
by: Özyilmaz, Ömer Tarik, et al.
Published: (2025)
by: Özyilmaz, Ömer Tarik, et al.
Published: (2025)
Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder
by: Li, Zhengyang, et al.
Published: (2026)
by: Li, Zhengyang, et al.
Published: (2026)
Evaluating ASR robustness to spontaneous speech errors: A study of WhisperX using a Speech Error Database
by: Alderete, John, et al.
Published: (2025)
by: Alderete, John, et al.
Published: (2025)
Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper
by: Xu, Tianyi, et al.
Published: (2024)
by: Xu, Tianyi, et al.
Published: (2024)
LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation
by: Li, Shaojun, et al.
Published: (2024)
by: Li, Shaojun, et al.
Published: (2024)
Data Whisperer: Efficient Data Selection for Task-Specific LLM Fine-Tuning via Few-Shot In-Context Learning
by: Wang, Shaobo, et al.
Published: (2025)
by: Wang, Shaobo, et al.
Published: (2025)
Whispering Context: Distilling Syntax and Semantics for Long Speech Transcripts
by: Altinok, Duygu
Published: (2025)
by: Altinok, Duygu
Published: (2025)
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples
by: Koike, Ryuto, et al.
Published: (2023)
by: Koike, Ryuto, et al.
Published: (2023)
Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning
by: Zhang, Yuchen, et al.
Published: (2026)
by: Zhang, Yuchen, et al.
Published: (2026)
Improving Domain-Specific ASR with LLM-Generated Contextual Descriptions
by: Suh, Jiwon, et al.
Published: (2024)
by: Suh, Jiwon, et al.
Published: (2024)
Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks
by: Everson, Kevin, et al.
Published: (2024)
by: Everson, Kevin, et al.
Published: (2024)
Enhancing ASR Performance in the Medical Domain for Dravidian Languages
by: Devarakonda, Sri Charan, et al.
Published: (2026)
by: Devarakonda, Sri Charan, et al.
Published: (2026)
Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages
by: Anidjar, Or Haim, et al.
Published: (2024)
by: Anidjar, Or Haim, et al.
Published: (2024)
Zero-Shot Context-Aware ASR for Diverse Arabic Varieties
by: Talafha, Bashar, et al.
Published: (2025)
by: Talafha, Bashar, et al.
Published: (2025)
Context-Enhanced Granular Edit Representation for Efficient and Accurate ASR Post-editing
by: Vejsiu, Luan, et al.
Published: (2025)
by: Vejsiu, Luan, et al.
Published: (2025)
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR
by: Peng, Yizhou, et al.
Published: (2025)
by: Peng, Yizhou, et al.
Published: (2025)
How Robust Are Large Language Models for Clinical Numeracy? An Empirical Study on Numerical Reasoning Abilities in Clinical Contexts
by: Nguyen, Minh-Vuong, et al.
Published: (2026)
by: Nguyen, Minh-Vuong, et al.
Published: (2026)
Whispering in Amharic: Fine-tuning Whisper for Low-resource Language
by: Gete, Dawit Ketema, et al.
Published: (2025)
by: Gete, Dawit Ketema, et al.
Published: (2025)
DrugRAG: Enhancing Pharmacy LLM Performance Through A Novel Retrieval-Augmented Generation Pipeline
by: Kazemzadeh, Houman, et al.
Published: (2025)
by: Kazemzadeh, Houman, et al.
Published: (2025)
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down
by: Wang, Yingzhi, et al.
Published: (2025)
by: Wang, Yingzhi, et al.
Published: (2025)
Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR
by: Kumar, Shashi, et al.
Published: (2026)
by: Kumar, Shashi, et al.
Published: (2026)
When Helpful Context Leaks: Privacy Risks in Domain-Adapted ASR
by: Züfle, Maike, et al.
Published: (2026)
by: Züfle, Maike, et al.
Published: (2026)
Semi-Autoregressive Streaming ASR With Label Context
by: Arora, Siddhant, et al.
Published: (2023)
by: Arora, Siddhant, et al.
Published: (2023)
Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages
by: de Zuazo, Xabier, et al.
Published: (2025)
by: de Zuazo, Xabier, et al.
Published: (2025)
Enhancing Context Through Contrast
by: Ambilduke, Kshitij, et al.
Published: (2024)
by: Ambilduke, Kshitij, et al.
Published: (2024)
Exploring the Potential of Multimodal LLM with Knowledge-Intensive Multimodal ASR
by: Wang, Minghan, et al.
Published: (2024)
by: Wang, Minghan, et al.
Published: (2024)
Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
by: Ridoy, Md Sazzadul Islam, et al.
Published: (2025)
by: Ridoy, Md Sazzadul Islam, et al.
Published: (2025)
Enhancing Long Context Performance in LLMs Through Inner Loop Query Mechanism
by: Tang, Yimin, et al.
Published: (2024)
by: Tang, Yimin, et al.
Published: (2024)
MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues
by: Binici, Kuluhan, et al.
Published: (2024)
by: Binici, Kuluhan, et al.
Published: (2024)
Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
Similar Items
-
Gradient Correlation Subspace Learning against Catastrophic Forgetting
by: Dubnov, Tammuz, et al.
Published: (2024) -
Fine-tuning Whisper for Pashto ASR: strategies and scale
by: Rahman, Hanif
Published: (2026) -
Extending Whisper with prompt tuning to target-speaker ASR
by: Ma, Hao, et al.
Published: (2023) -
Binaural sound source localization using a hybrid time and frequency domain model
by: Geva, Gil, et al.
Published: (2024) -
A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
by: Song, Zheshu, et al.
Published: (2024)