:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Herron, Felix, Rossato, Solange, Allauzen, Alexandre, Portet, François
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2604.22631
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Responsible Benchmarking of Fairness for Automatic Speech Recognition
by: Herron, Felix, et al.
Published: (2026)

Where Do Self-Supervised Speech Models Become Unfair?
by: Herron, Felix, et al.
Published: (2026)

LLM-based phoneme-to-grapheme for phoneme-based speech recognition
by: Ma, Te, et al.
Published: (2025)

Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition
by: Dong, Lukuang, et al.
Published: (2026)

Polynomial Mixing for Efficient Self-supervised Speech Encoders
by: Feillet, Eva, et al.
Published: (2026)

Emergent morpho-phonological representations in self-supervised speech models
by: Gauthier, Jon, et al.
Published: (2025)

LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
by: Parcollet, Titouan, et al.
Published: (2023)

Sustainable self-supervised learning for speech representations
by: Lugo, Luis, et al.
Published: (2024)

emg2speech: Synthesizing speech from electromyography using self-supervised speech models
by: Gowda, Harshavardhana T., et al.
Published: (2025)

Probing self-attention in self-supervised speech models for cross-linguistic differences
by: Gopinath, Sai, et al.
Published: (2024)

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)

PRODIS -- a speech database and a phoneme-based language model for the study of predictability effects in Polish
by: Malisz, Zofia, et al.
Published: (2024)

Word stress in self-supervised speech models: A cross-linguistic comparison
by: Bentum, Martijn, et al.
Published: (2025)

Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?
by: Méloux, Maxime, et al.
Published: (2025)

SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
by: Duong, Song, et al.
Published: (2025)

Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
by: Mohamed, Mukhtar, et al.
Published: (2024)

Employing self-supervised learning models for cross-linguistic child speech maturity classification
by: Zhang, Theo, et al.
Published: (2025)

Can GPT models Follow Human Summarization Guidelines? A Study for Targeted Communication Goals
by: Zhou, Yongxin, et al.
Published: (2023)

Tracking the emergence of linguistic structure in self-supervised models learning from speech
by: Kloots, Marianne de Heer, et al.
Published: (2026)

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)

AfriHuBERT: A self-supervised speech representation model for African languages
by: Alabi, Jesujoba O., et al.
Published: (2024)

Prominence-aware automatic speech recognition for conversational speech
by: Linke, Julian, et al.
Published: (2025)

Introduction to speech recognition
by: Dauphin, Gabriel
Published: (2024)

Automated Clinical Report Generation for Remote Cognitive Remediation: Comparing Knowledge-Engineered Templates and LLMs in Low-Resource Settings
by: Zhou, Yongxin, et al.
Published: (2026)

PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization
by: Zhou, Yongxin, et al.
Published: (2023)

Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition
by: Triantafyllopoulos, Andreas, et al.
Published: (2024)

Analyzing the relationships between pretraining language, phonetic, tonal, and speaker information in self-supervised speech models
by: Gubian, Michele, et al.
Published: (2025)

Do self-supervised speech and language models extract similar representations as human brain?
by: Chen, Peili, et al.
Published: (2023)

Tutorial: $φ$-Transductions in OpenFst via the Gallic Semiring
by: Cognetta, Marco, et al.
Published: (2025)

A* shortest string decoding for non-idempotent semirings
by: Gorman, Kyle, et al.
Published: (2022)

Improving child speech recognition with augmented child-like speech
by: Zhang, Yuanyuan, et al.
Published: (2024)

Cropping outperforms dropout as an augmentation strategy for self-supervised training of text embeddings
by: González-Márquez, Rita, et al.
Published: (2025)

Exploring Precision and Recall to assess the quality and diversity of LLMs
by: Bronnec, Florian Le, et al.
Published: (2024)

Mechanistic Interpretability as Statistical Estimation: A Variance Analysis
by: Méloux, Maxime, et al.
Published: (2025)

Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
by: Ueda, Kentaro, et al.
Published: (2025)

MedMeta: A Benchmark for LLMs in Synthesizing Meta-Analysis Conclusion from Medical Studies
by: Ha, Huy Hoang, et al.
Published: (2026)

Structured-Sparse Attention for Entity Tracking with Subquadratic Sequence Complexity
by: Zhao, Hangyue, et al.
Published: (2026)

Trading Complexity for Expressivity Through Structured Generalized Linear Token Mixing
by: Fagnou, Erwan, et al.
Published: (2026)

Chain and Causal Attention for Efficient Entity Tracking
by: Fagnou, Erwan, et al.
Published: (2024)

What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search
by: Zhang, Xinhao, et al.
Published: (2026)