Saved in:
| Main Authors: | van der Veen, Olaf, Dzebo, Semir, Littvay, Levi, Hawkins, Kirk, Dar, Oren |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.15213 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Populism Meets AI: Advancing Populism Research with LLMs
by: Jung, Yujin J., et al.
Published: (2025)
by: Jung, Yujin J., et al.
Published: (2025)
Unpacking Populist Secessionism: Elite Discourse and Mass Attitudes in Republika Srpska, Bosnia and Herzegovina
by: Semir Dzebo
Published: (2025)
by: Semir Dzebo
Published: (2025)
Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024)
by: Wu, Boyong, et al.
Published: (2024)
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
by: Lyth, Dan, et al.
Published: (2024)
by: Lyth, Dan, et al.
Published: (2024)
Code-switching in text and speech challenges information-theoretic speaker design
by: Bhattacharya, Debasmita, et al.
Published: (2024)
by: Bhattacharya, Debasmita, et al.
Published: (2024)
Prominence-aware automatic speech recognition for conversational speech
by: Linke, Julian, et al.
Published: (2025)
by: Linke, Julian, et al.
Published: (2025)
A thorough benchmark of automatic text classification: From traditional approaches to large language models
by: Cunha, Washington, et al.
Published: (2025)
by: Cunha, Washington, et al.
Published: (2025)
Red and blue language: Word choices in the Trump & Harris 2024 presidential debate
by: Wicke, Philipp, et al.
Published: (2024)
by: Wicke, Philipp, et al.
Published: (2024)
Extracting chemical food safety hazards from the scientific literature automatically using large language models
by: Özen, Neris, et al.
Published: (2024)
by: Özen, Neris, et al.
Published: (2024)
Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?
by: Araiza-Illan, Gloria, et al.
Published: (2023)
by: Araiza-Illan, Gloria, et al.
Published: (2023)
End-to-end Speech Recognition with similar length speech and text
by: Fan, Peng, et al.
Published: (2025)
by: Fan, Peng, et al.
Published: (2025)
A study on the impact of Self-Supervised Learning on automatic dysarthric speech assessment
by: Cadet, Xavier F., et al.
Published: (2023)
by: Cadet, Xavier F., et al.
Published: (2023)
Language-agnostic, automated assessment of listeners' speech recall using large language models
by: Herrmann, Björn
Published: (2025)
by: Herrmann, Björn
Published: (2025)
Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
by: Wright, George August, et al.
Published: (2023)
by: Wright, George August, et al.
Published: (2023)
An efficient text augmentation approach for contextualized Mandarin speech recognition
by: Zheng, Naijun, et al.
Published: (2024)
by: Zheng, Naijun, et al.
Published: (2024)
Spatio-temporal transformer to support automatic sign language translation
by: Ruiz, Christian, et al.
Published: (2025)
by: Ruiz, Christian, et al.
Published: (2025)
What makes a good metric? Evaluating automatic metrics for text-to-image consistency
by: Ross, Candace, et al.
Published: (2024)
by: Ross, Candace, et al.
Published: (2024)
Break Out the Silverware -- Semantic Understanding of Stored Household Items
by: Levi-Richter, Michaela, et al.
Published: (2025)
by: Levi-Richter, Michaela, et al.
Published: (2025)
Understanding the effects of language-specific class imbalance in multilingual fine-tuning
by: Jung, Vincent, et al.
Published: (2024)
by: Jung, Vincent, et al.
Published: (2024)
Synthetically generated text for supervised text analysis
by: Halterman, Andrew
Published: (2023)
by: Halterman, Andrew
Published: (2023)
Integrating automatic speech recognition into remote healthcare interpreting: A pilot study of its impact on interpreting quality
by: Tan, Shiyi, et al.
Published: (2025)
by: Tan, Shiyi, et al.
Published: (2025)
The evaluation of a code-switched Sepedi-English automatic speech recognition system
by: Phaladi, Amanda, et al.
Published: (2024)
by: Phaladi, Amanda, et al.
Published: (2024)
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet
by: Dhakal, Manish, et al.
Published: (2024)
by: Dhakal, Manish, et al.
Published: (2024)
RadEval: A framework for radiology text evaluation
by: Xu, Justin, et al.
Published: (2025)
by: Xu, Justin, et al.
Published: (2025)
PashtoTTS-Bench: automated screening for low-resource non-Latin-script text-to-speech
by: Rahman, Hanif
Published: (2026)
by: Rahman, Hanif
Published: (2026)
Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)
by: Miller, Justin K., et al.
Published: (2024)
Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters
by: Fujita, Kenichi, et al.
Published: (2024)
by: Fujita, Kenichi, et al.
Published: (2024)
Large language models struggle with ethnographic text annotation
by: Goodall, Leonardo S., et al.
Published: (2026)
by: Goodall, Leonardo S., et al.
Published: (2026)
AugSumm: towards generalizable speech summarization using synthetic labels from large language model
by: Jung, Jee-weon, et al.
Published: (2024)
by: Jung, Jee-weon, et al.
Published: (2024)
Data-driven grapheme-to-phoneme representations for a lexicon-free text-to-speech
by: Garg, Abhinav, et al.
Published: (2024)
by: Garg, Abhinav, et al.
Published: (2024)
Can large audio language models understand child stuttering speech? speech summarization, and source separation
by: Okocha, Chibuzor, et al.
Published: (2025)
by: Okocha, Chibuzor, et al.
Published: (2025)
A unified front-end framework for English text-to-speech synthesis
by: Ying, Zelin, et al.
Published: (2023)
by: Ying, Zelin, et al.
Published: (2023)
Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance
by: Petersen, Molly R., et al.
Published: (2023)
by: Petersen, Molly R., et al.
Published: (2023)
Perspectives on goal setting: Video‐reflexive ethnography with speech–language therapists and clients
by: Laurien Brauner, et al.
Published: (2024)
by: Laurien Brauner, et al.
Published: (2024)
Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics
by: Kocbek, Primoz, et al.
Published: (2025)
by: Kocbek, Primoz, et al.
Published: (2025)
Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
Impact of automatic speech recognition quality on Alzheimer's disease detection from spontaneous speech: a reproducible benchmark study with lexical modeling and statistical validation
by: Samanta, Himadri S
Published: (2026)
by: Samanta, Himadri S
Published: (2026)
Differentially-private text generation degrades output language quality
by: Çano, Erion, et al.
Published: (2025)
by: Çano, Erion, et al.
Published: (2025)
Machine-generated text detection prevents language model collapse
by: Drayson, George, et al.
Published: (2025)
by: Drayson, George, et al.
Published: (2025)
Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)
by: Pollano, Andres, et al.
Published: (2023)
Similar Items
-
Populism Meets AI: Advancing Populism Research with LLMs
by: Jung, Yujin J., et al.
Published: (2025) -
Unpacking Populist Secessionism: Elite Discourse and Mass Attitudes in Republika Srpska, Bosnia and Herzegovina
by: Semir Dzebo
Published: (2025) -
Transferable speech-to-text large language model alignment module
by: Wu, Boyong, et al.
Published: (2024) -
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
by: Lyth, Dan, et al.
Published: (2024) -
Code-switching in text and speech challenges information-theoretic speaker design
by: Bhattacharya, Debasmita, et al.
Published: (2024)