Saved in:
| Main Authors: | Roll, Nathan, Bhalerao, Pranav, Bartelds, Martijn, Pawar, Arjun, Tatsumi, Yuka, Ogunremi, Tolulope, Shani, Chen, Graham, Calbert, Sumner, Meghan, Jurafsky, Dan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.06972 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties
by: Roll, Nathan, et al.
Published: (2025)
by: Roll, Nathan, et al.
Published: (2025)
PSST! Prosodic Speech Segmentation with Transformers
by: Roll, Nathan, et al.
Published: (2023)
by: Roll, Nathan, et al.
Published: (2023)
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition
by: Bartelds, Martijn, et al.
Published: (2025)
by: Bartelds, Martijn, et al.
Published: (2025)
False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models
by: Kallini, Julie, et al.
Published: (2025)
by: Kallini, Julie, et al.
Published: (2025)
The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?
by: Shani, Chen, et al.
Published: (2026)
by: Shani, Chen, et al.
Published: (2026)
Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings
by: Attia, Ahmed Adel, et al.
Published: (2024)
by: Attia, Ahmed Adel, et al.
Published: (2024)
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments
by: Attia, Ahmed Adel, et al.
Published: (2024)
by: Attia, Ahmed Adel, et al.
Published: (2024)
ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus
by: Ogunremi, Tolulope, et al.
Published: (2023)
by: Ogunremi, Tolulope, et al.
Published: (2023)
"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most
by: Zhou, Kaitlyn, et al.
Published: (2026)
by: Zhou, Kaitlyn, et al.
Published: (2026)
OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
by: Ngo, Huong, et al.
Published: (2025)
by: Ngo, Huong, et al.
Published: (2025)
Beyond Tokens: Concept-Level Training Objectives for LLMs
by: Iyer, Laya, et al.
Published: (2026)
by: Iyer, Laya, et al.
Published: (2026)
Stepback: Enhanced Disentanglement for Voice Conversion via Multi-Task Learning
by: Yang, Qian, et al.
Published: (2025)
by: Yang, Qian, et al.
Published: (2025)
Multi-Stage Speaker Diarization for Noisy Classrooms
by: Khan, Ali Sartaz, et al.
Published: (2025)
by: Khan, Ali Sartaz, et al.
Published: (2025)
Learning Concepts, Not Tokens: Self-Supervised Semantic Alignment for Language Models
by: Zhang, Christine, et al.
Published: (2026)
by: Zhang, Christine, et al.
Published: (2026)
The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models
by: Roll, Nathan, et al.
Published: (2025)
by: Roll, Nathan, et al.
Published: (2025)
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets
by: Shi, Jiatong, et al.
Published: (2024)
by: Shi, Jiatong, et al.
Published: (2024)
PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation
by: Roll, Nathan
Published: (2025)
by: Roll, Nathan
Published: (2025)
Entanglement as Memory: Mechanistic Interpretability of Quantum Language Models
by: Roll, Nathan
Published: (2026)
by: Roll, Nathan
Published: (2026)
Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)
by: Mizrahi, Moran, et al.
Published: (2025)
Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
by: Jain, Yash, et al.
Published: (2024)
by: Jain, Yash, et al.
Published: (2024)
Optimizing Multilingual Text-To-Speech with Accents & Emotions
by: Pawar, Pranav, et al.
Published: (2025)
by: Pawar, Pranav, et al.
Published: (2025)
Voice "Cloning" is Style Transfer
by: Zhou, Kaitlyn, et al.
Published: (2026)
by: Zhou, Kaitlyn, et al.
Published: (2026)
La conquista de los Estados Unidos por España
by: William Graham Sumner
Published: (2005)
by: William Graham Sumner
Published: (2005)
Long-Term Health Outcomes of Early Parental Loss: A Case Study of African Adults
by: Blessing Oluwaferanmi Oyelami, et al.
Published: (2025)
by: Blessing Oluwaferanmi Oyelami, et al.
Published: (2025)
Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens
by: Manakul, Potsawee, et al.
Published: (2026)
by: Manakul, Potsawee, et al.
Published: (2026)
From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
by: Shani, Chen, et al.
Published: (2025)
by: Shani, Chen, et al.
Published: (2025)
Rethinking Word Similarity: Semantic Similarity through Classification Confusion
by: Zhou, Kaitlyn, et al.
Published: (2025)
by: Zhou, Kaitlyn, et al.
Published: (2025)
Metastatic Melanoma to the Urinary Bladder: A Rare Cause of Visible Haematuria
by: Olawale Ogunremi, et al.
Published: (2024)
by: Olawale Ogunremi, et al.
Published: (2024)
Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)
by: Saha, Anish, et al.
Published: (2024)
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
by: Chen, Chen, et al.
Published: (2024)
by: Chen, Chen, et al.
Published: (2024)
The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
by: Chen, William, et al.
Published: (2025)
by: Chen, William, et al.
Published: (2025)
Lane Change Classification and Prediction with Action Recognition Networks
by: Liang, Kai, et al.
Published: (2022)
by: Liang, Kai, et al.
Published: (2022)
Revisiting Categorical Color Perception in Scatterplots: Sequential, Diverging, and Categorical Palettes
by: Tseng, Chin, et al.
Published: (2024)
by: Tseng, Chin, et al.
Published: (2024)
Automatic Speech Recognition for the Ika Language
by: Nzenwata, Uchenna, et al.
Published: (2024)
by: Nzenwata, Uchenna, et al.
Published: (2024)
The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge
by: Tian, Jingguang, et al.
Published: (2024)
by: Tian, Jingguang, et al.
Published: (2024)
Characterizing simulation relations through control architectures in abstraction-based control
by: Calbert, Julien, et al.
Published: (2024)
by: Calbert, Julien, et al.
Published: (2024)
Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges
by: Liu, Dancheng, et al.
Published: (2024)
by: Liu, Dancheng, et al.
Published: (2024)
Migración, codesarrollo y capital social. Lineamientos para una estrategia de integración de dos mundos
by: David Roll
Published: (2010)
by: David Roll
Published: (2010)
Poor Man's Fortune
by: Roll, Jarod
Published: (2023)
by: Roll, Jarod
Published: (2023)
Similar Items
-
In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties
by: Roll, Nathan, et al.
Published: (2025) -
PSST! Prosodic Speech Segmentation with Transformers
by: Roll, Nathan, et al.
Published: (2023) -
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025) -
CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition
by: Bartelds, Martijn, et al.
Published: (2025) -
False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models
by: Kallini, Julie, et al.
Published: (2025)