Saved in:
| Main Authors: | Roux, Thibault Bañeras, Wottawa, Jane, Rouvier, Mickael, Merlin, Teva, Dufour, Richard |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.27542 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Paradigm for Interpreting Metrics and Identifying Critical Errors in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
Evaluation of Automatic Speech Recognition Using Generative Large Language Models
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
A Benchmark of French ASR Systems Based on Error Severity
by: Tholly, Antoine, et al.
Published: (2025)
by: Tholly, Antoine, et al.
Published: (2025)
A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks
by: Labrak, Yanis, et al.
Published: (2023)
by: Labrak, Yanis, et al.
Published: (2023)
An Empirical Analysis of Discrete Unit Representations in Speech Language Modeling Pre-training
by: Labrak, Yanis, et al.
Published: (2025)
by: Labrak, Yanis, et al.
Published: (2025)
Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems
by: Raymondaud, Quentin, et al.
Published: (2024)
by: Raymondaud, Quentin, et al.
Published: (2024)
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
by: Labrak, Yanis, et al.
Published: (2024)
by: Labrak, Yanis, et al.
Published: (2024)
How Important Is Tokenization in French Medical Masked Language Models?
by: Labrak, Yanis, et al.
Published: (2024)
by: Labrak, Yanis, et al.
Published: (2024)
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
by: Labrak, Yanis, et al.
Published: (2024)
by: Labrak, Yanis, et al.
Published: (2024)
Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 2
by: Bousquet, Pierre-Michel, et al.
Published: (2024)
by: Bousquet, Pierre-Michel, et al.
Published: (2024)
MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
by: Duret, Jarod, et al.
Published: (2024)
by: Duret, Jarod, et al.
Published: (2024)
SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation
by: Du, Jiayu, et al.
Published: (2024)
by: Du, Jiayu, et al.
Published: (2024)
Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
by: Sperber, Matthias, et al.
Published: (2024)
by: Sperber, Matthias, et al.
Published: (2024)
Identifying Reliable Evaluation Metrics for Scientific Text Revision
by: Jourdan, Léane, et al.
Published: (2025)
by: Jourdan, Léane, et al.
Published: (2025)
Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
by: Bañeras-Roux, Thibault, et al.
Published: (2026)
DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
by: Labrak, Yanis, et al.
Published: (2024)
by: Labrak, Yanis, et al.
Published: (2024)
Late Fusion and Multi-Level Fission Amplify Cross-Modal Transfer in Text-Speech LMs
by: Cuervo, Santiago, et al.
Published: (2025)
by: Cuervo, Santiago, et al.
Published: (2025)
Responsible Benchmarking of Fairness for Automatic Speech Recognition
by: Herron, Felix, et al.
Published: (2026)
by: Herron, Felix, et al.
Published: (2026)
Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
by: Rong, Yiming, et al.
Published: (2025)
by: Rong, Yiming, et al.
Published: (2025)
End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
by: Luu, Nam, et al.
Published: (2025)
by: Luu, Nam, et al.
Published: (2025)
HATS: High-Accuracy Triple-Set Watermarking for Large Language Models
by: Hu, Zhiqing, et al.
Published: (2025)
by: Hu, Zhiqing, et al.
Published: (2025)
Developing an Automatic Pronunciation Scorer: Aligning Speech Evaluation Models and Applied Linguistics Constructs
by: Danwei Cai, et al.
Published: (2025)
by: Danwei Cai, et al.
Published: (2025)
Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language
by: Abu, Turi, et al.
Published: (2025)
by: Abu, Turi, et al.
Published: (2025)
The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction
by: Amalvy, Arthur, et al.
Published: (2024)
by: Amalvy, Arthur, et al.
Published: (2024)
Automatic Speech Recognition for the Ika Language
by: Nzenwata, Uchenna, et al.
Published: (2024)
by: Nzenwata, Uchenna, et al.
Published: (2024)
Categorize Early, Integrate Late: Divergent Processing Strategies in Automatic Speech Recognition
by: Roll, Nathan, et al.
Published: (2026)
by: Roll, Nathan, et al.
Published: (2026)
The Role of Global and Local Context in Named Entity Recognition
by: Amalvy, Arthur, et al.
Published: (2023)
by: Amalvy, Arthur, et al.
Published: (2023)
AutoMetrics: Approximate Human Judgements with Automatically Generated Evaluators
by: Ryan, Michael J., et al.
Published: (2025)
by: Ryan, Michael J., et al.
Published: (2025)
Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)
by: Saha, Anish, et al.
Published: (2024)
Open Automatic Speech Recognition Models for Classical and Modern Standard Arabic
by: Grigoryan, Lilit, et al.
Published: (2025)
by: Grigoryan, Lilit, et al.
Published: (2025)
Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
by: Park, ChaeHun, et al.
Published: (2024)
by: Park, ChaeHun, et al.
Published: (2024)
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
by: Adila, Aulia, et al.
Published: (2024)
by: Adila, Aulia, et al.
Published: (2024)
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
by: Sudo, Yui, et al.
Published: (2025)
by: Sudo, Yui, et al.
Published: (2025)
Vietnamese Automatic Speech Recognition: A Revisit
by: Vu, Thi, et al.
Published: (2026)
by: Vu, Thi, et al.
Published: (2026)
Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset
by: Amalvy, Arthur, et al.
Published: (2023)
by: Amalvy, Arthur, et al.
Published: (2023)
A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
by: Obaidah, Qusai Abo, et al.
Published: (2024)
by: Obaidah, Qusai Abo, et al.
Published: (2024)
Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR
by: Kumar, Shashi, et al.
Published: (2026)
by: Kumar, Shashi, et al.
Published: (2026)
Automatic Speech Recognition for Sanskrit with Transfer Learning
by: Sadhukhan, Bidit, et al.
Published: (2025)
by: Sadhukhan, Bidit, et al.
Published: (2025)
Similar Items
-
A Paradigm for Interpreting Metrics and Identifying Critical Errors in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026) -
Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026) -
A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language
by: Bañeras-Roux, Thibault, et al.
Published: (2026) -
Evaluation of Automatic Speech Recognition Using Generative Large Language Models
by: Bañeras-Roux, Thibault, et al.
Published: (2026) -
A Benchmark of French ASR Systems Based on Error Severity
by: Tholly, Antoine, et al.
Published: (2025)