:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wennberg, Ulme, Henter, Gustav Eje
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2404.16574
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
by: Tu, Ruibo, et al.
Published: (2024)

VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency
by: Torgashov, Nikita, et al.
Published: (2025)

VoXtream2: Full-stream TTS with dynamic speaking rate control
by: Torgashov, Nikita, et al.
Published: (2026)

Do Bias Benchmarks Generalise? Evidence from Voice-based Evaluation of Gender Bias in SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)

Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)

The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2026)

Matcha-TTS: A fast TTS architecture with conditional flow matching
by: Mehta, Shivam, et al.
Published: (2023)

When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)

EmojiVoice: Towards long-term controllable expressivity in robot speech
by: Tuttösí, Paige, et al.
Published: (2025)

Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models
by: Veerendranath, Vishruth, et al.
Published: (2024)

The Effect of Scripts and Formats on LLM Numeracy
by: Reddy, Varshini, et al.
Published: (2026)

How Robust Are Large Language Models for Clinical Numeracy? An Empirical Study on Numerical Reasoning Abilities in Clinical Contexts
by: Nguyen, Minh-Vuong, et al.
Published: (2026)

Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech
by: Mehta, Shivam, et al.
Published: (2024)

Unified speech and gesture synthesis using flow matching
by: Mehta, Shivam, et al.
Published: (2023)

Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models
by: Li, Haoyang, et al.
Published: (2025)

Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
by: Deng, Ningyuan, et al.
Published: (2025)

Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022
by: Kucherenko, Taras, et al.
Published: (2023)

Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis
by: Mehta, Shivam, et al.
Published: (2024)

Voice Conversion-based Privacy through Adversarial Information Hiding
by: Webber, Jacob J, et al.
Published: (2024)

Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework
by: Tu, Ruibo, et al.
Published: (2024)

TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)

Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation
by: Laiyk, Nurkhan, et al.
Published: (2026)

Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch
by: Zhao, Jinman, et al.
Published: (2024)

Large Language Models Are Human-Like Internally
by: Kuribayashi, Tatsuki, et al.
Published: (2025)

ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights
by: Santosh, T. Y. S. S, et al.
Published: (2024)

NILE: Internal Consistency Alignment in Large Language Models
by: Hu, Minda, et al.
Published: (2024)

Language Mixing in Reasoning Language Models: Patterns, Impact, and Internal Causes
by: Wang, Mingyang, et al.
Published: (2025)

Internal Consistency and Self-Feedback in Large Language Models: A Survey
by: Liang, Xun, et al.
Published: (2024)

The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
by: Newman, Benjamin, et al.
Published: (2025)

Calibrating Reasoning in Language Models with Internal Consistency
by: Xie, Zhihui, et al.
Published: (2024)

HiFi-Glot: High-Fidelity Neural Formant Synthesis with Differentiable Resonant Filters
by: Gu, Yicheng, et al.
Published: (2024)

Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
by: Nagy, Rajmund, et al.
Published: (2024)

Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
by: Ghosh, Poulami, et al.
Published: (2024)

Benchmarking Language Model Creativity: A Case Study on Code Generation
by: Lu, Yining, et al.
Published: (2024)

Adapting Definition Modeling for New Languages: A Case Study on Belarusian
by: Kazakouskaya, Daniela, et al.
Published: (2025)

Prompt-Guided Internal States for Hallucination Detection of Large Language Models
by: Zhang, Fujie, et al.
Published: (2024)

UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models
by: Wu, Jiajun, et al.
Published: (2025)

Probing Internal Representations of Multi-Word Verbs in Large Language Models
by: Kissane, Hassane, et al.
Published: (2025)

The Geometry of Tokens in Internal Representations of Large Language Models
by: Viswanathan, Karthik, et al.
Published: (2025)

Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
by: Proebsting, Grace, et al.
Published: (2025)