:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	van der Veen, Olaf, Dzebo, Semir, Littvay, Levi, Hawkins, Kirk, Dar, Oren
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.15213
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Populism Meets AI: Advancing Populism Research with LLMs
by: Jung, Yujin J., et al.
Published: (2025)

Unpacking Populist Secessionism: Elite Discourse and Mass Attitudes in Republika Srpska, Bosnia and Herzegovina
by: Semir Dzebo
Published: (2025)

Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024)

Natural language guidance of high-fidelity text-to-speech with synthetic annotations
by: Lyth, Dan, et al.
Published: (2024)

Code-switching in text and speech challenges information-theoretic speaker design
by: Bhattacharya, Debasmita, et al.
Published: (2024)

Prominence-aware automatic speech recognition for conversational speech
by: Linke, Julian, et al.
Published: (2025)

A thorough benchmark of automatic text classification: From traditional approaches to large language models
by: Cunha, Washington, et al.
Published: (2025)

Red and blue language: Word choices in the Trump & Harris 2024 presidential debate
by: Wicke, Philipp, et al.
Published: (2024)

Extracting chemical food safety hazards from the scientific literature automatically using large language models
by: Özen, Neris, et al.
Published: (2024)

Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?
by: Araiza-Illan, Gloria, et al.
Published: (2023)

End-to-end Speech Recognition with similar length speech and text
by: Fan, Peng, et al.
Published: (2025)

A study on the impact of Self-Supervised Learning on automatic dysarthric speech assessment
by: Cadet, Xavier F., et al.
Published: (2023)

Language-agnostic, automated assessment of listeners' speech recall using large language models
by: Herrmann, Björn
Published: (2025)

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
by: Wright, George August, et al.
Published: (2023)

An efficient text augmentation approach for contextualized Mandarin speech recognition
by: Zheng, Naijun, et al.
Published: (2024)

Spatio-temporal transformer to support automatic sign language translation
by: Ruiz, Christian, et al.
Published: (2025)

What makes a good metric? Evaluating automatic metrics for text-to-image consistency
by: Ross, Candace, et al.
Published: (2024)

Break Out the Silverware -- Semantic Understanding of Stored Household Items
by: Levi-Richter, Michaela, et al.
Published: (2025)

Understanding the effects of language-specific class imbalance in multilingual fine-tuning
by: Jung, Vincent, et al.
Published: (2024)

Synthetically generated text for supervised text analysis
by: Halterman, Andrew
Published: (2023)

Integrating automatic speech recognition into remote healthcare interpreting: A pilot study of its impact on interpreting quality
by: Tan, Shiyi, et al.
Published: (2025)

The evaluation of a code-switched Sepedi-English automatic speech recognition system
by: Phaladi, Amanda, et al.
Published: (2024)

Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
by: Dhakal, Manish, et al.
Published: (2024)

RadEval: A framework for radiology text evaluation
by: Xu, Justin, et al.
Published: (2025)

PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026)

Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)

Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)

Large language models struggle with ethnographic text annotation
by: Goodall, Leonardo S., et al.
Published: (2026)

AugSumm: towards generalizable speech summarization using synthetic labels from large language model
by: Jung, Jee-weon, et al.
Published: (2024)

Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)

Can large audio language models understand child stuttering speech? speech summarization, and source separation
by: Okocha, Chibuzor, et al.
Published: (2025)

A unified front-end framework for English text-to-speech synthesis
by: Ying, Zelin, et al.
Published: (2023)

Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance
by: Petersen, Molly R., et al.
Published: (2023)

Perspectives on goal setting: Video‐reflexive ethnography with speech–language therapists and clients
by: Laurien Brauner, et al.
Published: (2024)

Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics
by: Kocbek, Primoz, et al.
Published: (2025)

Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)

Impact of automatic speech recognition quality on Alzheimer's disease detection from spontaneous speech: a reproducible benchmark study with lexical modeling and statistical validation
by: Samanta, Himadri S
Published: (2026)

Differentially-private text generation degrades output language quality
by: Çano, Erion, et al.
Published: (2025)

Machine-generated text detection prevents language model collapse
by: Drayson, George, et al.
Published: (2025)

Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)