:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Roll, Nathan, Bhalerao, Pranav, Bartelds, Martijn, Pawar, Arjun, Tatsumi, Yuka, Ogunremi, Tolulope, Shani, Chen, Graham, Calbert, Sumner, Meghan, Jurafsky, Dan
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2601.06972
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties
by: Roll, Nathan, et al.
Published: (2025)

PSST! Prosodic Speech Segmentation with Transformers
by: Roll, Nathan, et al.
Published: (2023)

Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)

CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition
by: Bartelds, Martijn, et al.
Published: (2025)

False Friends Are Not Foes: Investigating Vocabulary Overlap in Multilingual Language Models
by: Kallini, Julie, et al.
Published: (2025)

The Roots of Performance Disparity in Multilingual Language Models: Intrinsic Modeling Difficulty or Design Choices?
by: Shani, Chen, et al.
Published: (2026)

Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings
by: Attia, Ahmed Adel, et al.
Published: (2024)

CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments
by: Attia, Ahmed Adel, et al.
Published: (2024)

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus
by: Ogunremi, Tolulope, et al.
Published: (2023)

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most
by: Zhou, Kaitlyn, et al.
Published: (2026)

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
by: Ngo, Huong, et al.
Published: (2025)

Beyond Tokens: Concept-Level Training Objectives for LLMs
by: Iyer, Laya, et al.
Published: (2026)

Stepback: Enhanced Disentanglement for Voice Conversion via Multi-Task Learning
by: Yang, Qian, et al.
Published: (2025)

Multi-Stage Speaker Diarization for Noisy Classrooms
by: Khan, Ali Sartaz, et al.
Published: (2025)

Learning Concepts, Not Tokens: Self-Supervised Semantic Alignment for Language Models
by: Zhang, Christine, et al.
Published: (2026)

The Text Aphasia Battery (TAB): A Clinically-Grounded Benchmark for Aphasia-Like Deficits in Language Models
by: Roll, Nathan, et al.
Published: (2025)

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets
by: Shi, Jiatong, et al.
Published: (2024)

PolyPrompt: Automating Knowledge Extraction from Multilingual Language Models with Dynamic Prompt Generation
by: Roll, Nathan
Published: (2025)

Entanglement as Memory: Mechanistic Interpretability of Quantum Language Models
by: Roll, Nathan
Published: (2026)

Cooking Up Creativity: Enhancing LLM Creativity through Structured Recombination
by: Mizrahi, Moran, et al.
Published: (2025)

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
by: Jain, Yash, et al.
Published: (2024)

Optimizing Multilingual Text-To-Speech with Accents & Emotions
by: Pawar, Pranav, et al.
Published: (2025)

Voice "Cloning" is Style Transfer
by: Zhou, Kaitlyn, et al.
Published: (2026)

La conquista de los Estados Unidos por España
by: William Graham Sumner
Published: (2005)

Long-Term Health Outcomes of Early Parental Loss: A Case Study of African Adults
by: Blessing Oluwaferanmi Oyelami, et al.
Published: (2025)

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens
by: Manakul, Potsawee, et al.
Published: (2026)

From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
by: Shani, Chen, et al.
Published: (2025)

Rethinking Word Similarity: Semantic Similarity through Classification Confusion
by: Zhou, Kaitlyn, et al.
Published: (2025)

Metastatic Melanoma to the Urinary Bladder: A Rare Cause of Visible Haematuria
by: Olawale Ogunremi, et al.
Published: (2024)

Automatic Speech Recognition for Hindi
by: Saha, Anish, et al.
Published: (2024)

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition
by: Chen, Chen, et al.
Published: (2024)

The ML-SUPERB 2.0 Challenge: Towards Inclusive ASR Benchmarking for All Language Varieties
by: Chen, William, et al.
Published: (2025)

Lane Change Classification and Prediction with Action Recognition Networks
by: Liang, Kai, et al.
Published: (2022)

Revisiting Categorical Color Perception in Scatterplots: Sequential, Diverging, and Categorical Palettes
by: Tseng, Chin, et al.
Published: (2024)

Automatic Speech Recognition for the Ika Language
by: Nzenwata, Uchenna, et al.
Published: (2024)

The RoyalFlush Automatic Speech Diarization and Recognition System for In-Car Multi-Channel Automatic Speech Recognition Challenge
by: Tian, Jingguang, et al.
Published: (2024)

Characterizing simulation relations through control architectures in abstraction-based control
by: Calbert, Julien, et al.
Published: (2024)

Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges
by: Liu, Dancheng, et al.
Published: (2024)

Migración, codesarrollo y capital social. Lineamientos para una estrategia de integración de dos mundos
by: David Roll
Published: (2010)

Poor Man's Fortune
by: Roll, Jarod
Published: (2023)