:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Roux, Thibault Bañeras, Wottawa, Jane, Rouvier, Mickael, Merlin, Teva, Dufour, Richard
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2604.27542
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Paradigm for Interpreting Metrics and Identifying Critical Errors in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026)

Qualitative Evaluation of Language Model Rescoring in Automatic Speech Recognition
by: Bañeras-Roux, Thibault, et al.
Published: (2026)

A Comprehensive Analysis of Tokenization and Self-Supervised Learning in End-to-End Automatic Speech Recognition applied on French Language
by: Bañeras-Roux, Thibault, et al.
Published: (2026)

Evaluation of Automatic Speech Recognition Using Generative Large Language Models
by: Bañeras-Roux, Thibault, et al.
Published: (2026)

A Benchmark of French ASR Systems Based on Error Severity
by: Tholly, Antoine, et al.
Published: (2025)

A Zero-shot and Few-shot Study of Instruction-Finetuned Large Language Models Applied to Clinical and Biomedical Tasks
by: Labrak, Yanis, et al.
Published: (2023)

An Empirical Analysis of Discrete Unit Representations in Speech Language Modeling Pre-training
by: Labrak, Yanis, et al.
Published: (2025)

Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems
by: Raymondaud, Quentin, et al.
Published: (2024)

Zero-Shot End-To-End Spoken Question Answering In Medical Domain
by: Labrak, Yanis, et al.
Published: (2024)

How Important Is Tokenization in French Medical Masked Language Models?
by: Labrak, Yanis, et al.
Published: (2024)

BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
by: Labrak, Yanis, et al.
Published: (2024)

Asymmetric and trial-dependent modeling: the contribution of LIA to SdSV Challenge Task 2
by: Bousquet, Pierre-Michel, et al.
Published: (2024)

MSP-Podcast SER Challenge 2024: L'antenne du Ventoux Multimodal Self-Supervised Learning for Speech Emotion Recognition
by: Duret, Jarod, et al.
Published: (2024)

SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation
by: Du, Jiayu, et al.
Published: (2024)

Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation
by: Sperber, Matthias, et al.
Published: (2024)

Identifying Reliable Evaluation Metrics for Scientific Text Revision
by: Jourdan, Léane, et al.
Published: (2025)

Closing the Speech-Text Gap with Limited Audio for Effective Domain Adaptation in LLM-Based ASR
by: Bañeras-Roux, Thibault, et al.
Published: (2026)

DrBenchmark: A Large Language Understanding Evaluation Benchmark for French Biomedical Domain
by: Labrak, Yanis, et al.
Published: (2024)

Late Fusion and Multi-Level Fission Amplify Cross-Modal Transfer in Text-Speech LMs
by: Cuervo, Santiago, et al.
Published: (2025)

Responsible Benchmarking of Fairness for Automatic Speech Recognition
by: Herron, Felix, et al.
Published: (2026)

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition
by: Rong, Yiming, et al.
Published: (2025)

End-to-end Automatic Speech Recognition and Speech Translation: Integration of Speech Foundational Models and LLMs
by: Luu, Nam, et al.
Published: (2025)

HATS: High-Accuracy Triple-Set Watermarking for Large Language Models
by: Hu, Zhiqing, et al.
Published: (2025)

Developing an Automatic Pronunciation Scorer: Aligning Speech Evaluation Models and Applied Linguistics Constructs
by: Danwei Cai, et al.
Published: (2025)

Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language
by: Abu, Turi, et al.
Published: (2025)

The Role of Natural Language Processing Tasks in Automatic Literary Character Network Construction
by: Amalvy, Arthur, et al.
Published: (2024)

Automatic Speech Recognition for the Ika Language
by: Nzenwata, Uchenna, et al.
Published: (2024)

Categorize Early, Integrate Late: Divergent Processing Strategies in Automatic Speech Recognition
by: Roll, Nathan, et al.
Published: (2026)

The Role of Global and Local Context in Named Entity Recognition
by: Amalvy, Arthur, et al.
Published: (2023)

AutoMetrics: Approximate Human Judgements with Automatically Generated Evaluators
by: Ryan, Michael J., et al.
Published: (2025)

Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)

Open Automatic Speech Recognition Models for Classical and Modern Standard Arabic
by: Grigoryan, Lilit, et al.
Published: (2025)

Evaluating Automatic Speech Recognition Systems for Korean Meteorological Experts
by: Park, ChaeHun, et al.
Published: (2024)

Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities
by: Adila, Aulia, et al.
Published: (2024)

OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary
by: Sudo, Yui, et al.
Published: (2025)

Vietnamese Automatic Speech Recognition: A Revisit
by: Vu, Thi, et al.
Published: (2026)

Learning to Rank Context for Named Entity Recognition Using a Synthetic Dataset
by: Amalvy, Arthur, et al.
Published: (2023)

A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain
by: Obaidah, Qusai Abo, et al.
Published: (2024)

Distilling Conversations: Abstract Compression of Conversational Audio Context for LLM-based ASR
by: Kumar, Shashi, et al.
Published: (2026)

Automatic Speech Recognition for Sanskrit with Transfer Learning
by: Sadhukhan, Bidit, et al.
Published: (2025)