Saved in:
| Main Authors: | Herron, Felix, Rossato, Solange, Allauzen, Alexandre, Portet, François |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.22631 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Responsible Benchmarking of Fairness for Automatic Speech Recognition
by: Herron, Felix, et al.
Published: (2026)
by: Herron, Felix, et al.
Published: (2026)
Where Do Self-Supervised Speech Models Become Unfair?
by: Herron, Felix, et al.
Published: (2026)
by: Herron, Felix, et al.
Published: (2026)
LLM-based phoneme-to-grapheme for phoneme-based speech recognition
by: Ma, Te, et al.
Published: (2025)
by: Ma, Te, et al.
Published: (2025)
Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition
by: Dong, Lukuang, et al.
Published: (2026)
by: Dong, Lukuang, et al.
Published: (2026)
Polynomial Mixing for Efficient Self-supervised Speech Encoders
by: Feillet, Eva, et al.
Published: (2026)
by: Feillet, Eva, et al.
Published: (2026)
Emergent morpho-phonological representations in self-supervised speech models
by: Gauthier, Jon, et al.
Published: (2025)
by: Gauthier, Jon, et al.
Published: (2025)
LeBenchmark 2.0: a Standardized, Replicable and Enhanced Framework for Self-supervised Representations of French Speech
by: Parcollet, Titouan, et al.
Published: (2023)
by: Parcollet, Titouan, et al.
Published: (2023)
Sustainable self-supervised learning for speech representations
by: Lugo, Luis, et al.
Published: (2024)
by: Lugo, Luis, et al.
Published: (2024)
emg2speech: Synthesizing speech from electromyography using self-supervised speech models
by: Gowda, Harshavardhana T., et al.
Published: (2025)
by: Gowda, Harshavardhana T., et al.
Published: (2025)
Probing self-attention in self-supervised speech models for cross-linguistic differences
by: Gopinath, Sai, et al.
Published: (2024)
by: Gopinath, Sai, et al.
Published: (2024)
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)
by: Garg, Abhinav, et al.
Published: (2024)
PRODIS -- a speech database and a phoneme-based language model for the study of predictability effects in Polish
by: Malisz, Zofia, et al.
Published: (2024)
by: Malisz, Zofia, et al.
Published: (2024)
Word stress in self-supervised speech models: A cross-linguistic comparison
by: Bentum, Martijn, et al.
Published: (2025)
by: Bentum, Martijn, et al.
Published: (2025)
Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?
by: Méloux, Maxime, et al.
Published: (2025)
by: Méloux, Maxime, et al.
Published: (2025)
SCOPE: A Self-supervised Framework for Improving Faithfulness in Conditional Text Generation
by: Duong, Song, et al.
Published: (2025)
by: Duong, Song, et al.
Published: (2025)
Orthogonality and isotropy of speaker and phonetic information in self-supervised speech representations
by: Mohamed, Mukhtar, et al.
Published: (2024)
by: Mohamed, Mukhtar, et al.
Published: (2024)
Employing self-supervised learning models for cross-linguistic child speech maturity classification
by: Zhang, Theo, et al.
Published: (2025)
by: Zhang, Theo, et al.
Published: (2025)
Can GPT models Follow Human Summarization Guidelines? A Study for Targeted Communication Goals
by: Zhou, Yongxin, et al.
Published: (2023)
by: Zhou, Yongxin, et al.
Published: (2023)
Tracking the emergence of linguistic structure in self-supervised models learning from speech
by: Kloots, Marianne de Heer, et al.
Published: (2026)
by: Kloots, Marianne de Heer, et al.
Published: (2026)
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)
by: Fujita, Kenichi, et al.
Published: (2024)
AfriHuBERT: A self-supervised speech representation model for African languages
by: Alabi, Jesujoba O., et al.
Published: (2024)
by: Alabi, Jesujoba O., et al.
Published: (2024)
Prominence-aware automatic speech recognition for conversational speech
by: Linke, Julian, et al.
Published: (2025)
by: Linke, Julian, et al.
Published: (2025)
Introduction to speech recognition
by: Dauphin, Gabriel
Published: (2024)
by: Dauphin, Gabriel
Published: (2024)
Automated Clinical Report Generation for Remote Cognitive Remediation: Comparing Knowledge-Engineered Templates and LLMs in Low-Resource Settings
by: Zhou, Yongxin, et al.
Published: (2026)
by: Zhou, Yongxin, et al.
Published: (2026)
PSentScore: Evaluating Sentiment Polarity in Dialogue Summarization
by: Zhou, Yongxin, et al.
Published: (2023)
by: Zhou, Yongxin, et al.
Published: (2023)
Enrolment-based personalisation for improving individual-level fairness in speech emotion recognition
by: Triantafyllopoulos, Andreas, et al.
Published: (2024)
by: Triantafyllopoulos, Andreas, et al.
Published: (2024)
Analyzing the relationships between pretraining language, phonetic, tonal, and speaker information in self-supervised speech models
by: Gubian, Michele, et al.
Published: (2025)
by: Gubian, Michele, et al.
Published: (2025)
Do self-supervised speech and language models extract similar representations as human brain?
by: Chen, Peili, et al.
Published: (2023)
by: Chen, Peili, et al.
Published: (2023)
Tutorial: $φ$-Transductions in OpenFst via the Gallic Semiring
by: Cognetta, Marco, et al.
Published: (2025)
by: Cognetta, Marco, et al.
Published: (2025)
A* shortest string decoding for non-idempotent semirings
by: Gorman, Kyle, et al.
Published: (2022)
by: Gorman, Kyle, et al.
Published: (2022)
Improving child speech recognition with augmented child-like speech
by: Zhang, Yuanyuan, et al.
Published: (2024)
by: Zhang, Yuanyuan, et al.
Published: (2024)
Cropping outperforms dropout as an augmentation strategy for self-supervised training of text embeddings
by: González-Márquez, Rita, et al.
Published: (2025)
by: González-Márquez, Rita, et al.
Published: (2025)
Exploring Precision and Recall to assess the quality and diversity of LLMs
by: Bronnec, Florian Le, et al.
Published: (2024)
by: Bronnec, Florian Le, et al.
Published: (2024)
Mechanistic Interpretability as Statistical Estimation: A Variance Analysis
by: Méloux, Maxime, et al.
Published: (2025)
by: Méloux, Maxime, et al.
Published: (2025)
Merging Continual Pretraining Models for Domain-Specialized LLMs: A Case Study in Finance
by: Ueda, Kentaro, et al.
Published: (2025)
by: Ueda, Kentaro, et al.
Published: (2025)
MedMeta: A Benchmark for LLMs in Synthesizing Meta-Analysis Conclusion from Medical Studies
by: Ha, Huy Hoang, et al.
Published: (2026)
by: Ha, Huy Hoang, et al.
Published: (2026)
Structured-Sparse Attention for Entity Tracking with Subquadratic Sequence Complexity
by: Zhao, Hangyue, et al.
Published: (2026)
by: Zhao, Hangyue, et al.
Published: (2026)
Trading Complexity for Expressivity Through Structured Generalized Linear Token Mixing
by: Fagnou, Erwan, et al.
Published: (2026)
by: Fagnou, Erwan, et al.
Published: (2026)
Chain and Causal Attention for Efficient Entity Tracking
by: Fagnou, Erwan, et al.
Published: (2024)
by: Fagnou, Erwan, et al.
Published: (2024)
What Makes an LLM a Good Optimizer? A Trajectory Analysis of LLM-Guided Evolutionary Search
by: Zhang, Xinhao, et al.
Published: (2026)
by: Zhang, Xinhao, et al.
Published: (2026)
Similar Items
-
Responsible Benchmarking of Fairness for Automatic Speech Recognition
by: Herron, Felix, et al.
Published: (2026) -
Where Do Self-Supervised Speech Models Become Unfair?
by: Herron, Felix, et al.
Published: (2026) -
LLM-based phoneme-to-grapheme for phoneme-based speech recognition
by: Ma, Te, et al.
Published: (2025) -
Advancing LLM-based phoneme-to-grapheme for multilingual speech recognition
by: Dong, Lukuang, et al.
Published: (2026) -
Polynomial Mixing for Efficient Self-supervised Speech Encoders
by: Feillet, Eva, et al.
Published: (2026)