Saved in:
| Main Authors: | Wennberg, Ulme, Henter, Gustav Eje |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.16574 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
by: Tu, Ruibo, et al.
Published: (2024)
by: Tu, Ruibo, et al.
Published: (2024)
VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency
by: Torgashov, Nikita, et al.
Published: (2025)
by: Torgashov, Nikita, et al.
Published: (2025)
VoXtream2: Full-stream TTS with dynamic speaking rate control
by: Torgashov, Nikita, et al.
Published: (2026)
by: Torgashov, Nikita, et al.
Published: (2026)
Do Bias Benchmarks Generalise? Evidence from Voice-based Evaluation of Gender Bias in SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
The Voice Behind the Words: Quantifying Intersectional Bias in SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2026)
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2026)
Matcha-TTS: A fast TTS architecture with conditional flow matching
by: Mehta, Shivam, et al.
Published: (2023)
by: Mehta, Shivam, et al.
Published: (2023)
When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)
EmojiVoice: Towards long-term controllable expressivity in robot speech
by: Tuttösí, Paige, et al.
Published: (2025)
by: Tuttösí, Paige, et al.
Published: (2025)
Pre-Calc: Learning to Use the Calculator Improves Numeracy in Language Models
by: Veerendranath, Vishruth, et al.
Published: (2024)
by: Veerendranath, Vishruth, et al.
Published: (2024)
The Effect of Scripts and Formats on LLM Numeracy
by: Reddy, Varshini, et al.
Published: (2026)
by: Reddy, Varshini, et al.
Published: (2026)
How Robust Are Large Language Models for Clinical Numeracy? An Empirical Study on Numerical Reasoning Abilities in Clinical Contexts
by: Nguyen, Minh-Vuong, et al.
Published: (2026)
by: Nguyen, Minh-Vuong, et al.
Published: (2026)
Should you use a probabilistic duration model in TTS? Probably! Especially for spontaneous speech
by: Mehta, Shivam, et al.
Published: (2024)
by: Mehta, Shivam, et al.
Published: (2024)
Unified speech and gesture synthesis using flow matching
by: Mehta, Shivam, et al.
Published: (2023)
by: Mehta, Shivam, et al.
Published: (2023)
Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models
by: Li, Haoyang, et al.
Published: (2025)
by: Li, Haoyang, et al.
Published: (2025)
Revealing the Numeracy Gap: An Empirical Investigation of Text Embedding Models
by: Deng, Ningyuan, et al.
Published: (2025)
by: Deng, Ningyuan, et al.
Published: (2025)
Evaluating gesture generation in a large-scale open challenge: The GENEA Challenge 2022
by: Kucherenko, Taras, et al.
Published: (2023)
by: Kucherenko, Taras, et al.
Published: (2023)
Fake it to make it: Using synthetic data to remedy the data shortage in joint multimodal speech-and-gesture synthesis
by: Mehta, Shivam, et al.
Published: (2024)
by: Mehta, Shivam, et al.
Published: (2024)
Voice Conversion-based Privacy through Adversarial Information Hiding
by: Webber, Jacob J, et al.
Published: (2024)
by: Webber, Jacob J, et al.
Published: (2024)
Causality for Tabular Data Synthesis: A High-Order Structure Causal Benchmark Framework
by: Tu, Ruibo, et al.
Published: (2024)
by: Tu, Ruibo, et al.
Published: (2024)
TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)
by: Xu, Qiancheng, et al.
Published: (2026)
Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation
by: Laiyk, Nurkhan, et al.
Published: (2026)
by: Laiyk, Nurkhan, et al.
Published: (2026)
Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch
by: Zhao, Jinman, et al.
Published: (2024)
by: Zhao, Jinman, et al.
Published: (2024)
Large Language Models Are Human-Like Internally
by: Kuribayashi, Tatsuki, et al.
Published: (2025)
by: Kuribayashi, Tatsuki, et al.
Published: (2025)
ECtHR-PCR: A Dataset for Precedent Understanding and Prior Case Retrieval in the European Court of Human Rights
by: Santosh, T. Y. S. S, et al.
Published: (2024)
by: Santosh, T. Y. S. S, et al.
Published: (2024)
NILE: Internal Consistency Alignment in Large Language Models
by: Hu, Minda, et al.
Published: (2024)
by: Hu, Minda, et al.
Published: (2024)
Language Mixing in Reasoning Language Models: Patterns, Impact, and Internal Causes
by: Wang, Mingyang, et al.
Published: (2025)
by: Wang, Mingyang, et al.
Published: (2025)
Internal Consistency and Self-Feedback in Large Language Models: A Survey
by: Liang, Xun, et al.
Published: (2024)
by: Liang, Xun, et al.
Published: (2024)
The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
by: Newman, Benjamin, et al.
Published: (2025)
by: Newman, Benjamin, et al.
Published: (2025)
Calibrating Reasoning in Language Models with Internal Consistency
by: Xie, Zhihui, et al.
Published: (2024)
by: Xie, Zhihui, et al.
Published: (2024)
HiFi-Glot: High-Fidelity Neural Formant Synthesis with Differentiable Resonant Filters
by: Gu, Yicheng, et al.
Published: (2024)
by: Gu, Yicheng, et al.
Published: (2024)
Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis
by: Nagy, Rajmund, et al.
Published: (2024)
by: Nagy, Rajmund, et al.
Published: (2024)
Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
by: Ghosh, Poulami, et al.
Published: (2024)
by: Ghosh, Poulami, et al.
Published: (2024)
Benchmarking Language Model Creativity: A Case Study on Code Generation
by: Lu, Yining, et al.
Published: (2024)
by: Lu, Yining, et al.
Published: (2024)
Adapting Definition Modeling for New Languages: A Case Study on Belarusian
by: Kazakouskaya, Daniela, et al.
Published: (2025)
by: Kazakouskaya, Daniela, et al.
Published: (2025)
Prompt-Guided Internal States for Hallucination Detection of Large Language Models
by: Zhang, Fujie, et al.
Published: (2024)
by: Zhang, Fujie, et al.
Published: (2024)
UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models
by: Wu, Jiajun, et al.
Published: (2025)
by: Wu, Jiajun, et al.
Published: (2025)
Probing Internal Representations of Multi-Word Verbs in Large Language Models
by: Kissane, Hassane, et al.
Published: (2025)
by: Kissane, Hassane, et al.
Published: (2025)
The Geometry of Tokens in Internal Representations of Large Language Models
by: Viswanathan, Karthik, et al.
Published: (2025)
by: Viswanathan, Karthik, et al.
Published: (2025)
Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
by: Proebsting, Grace, et al.
Published: (2025)
by: Proebsting, Grace, et al.
Published: (2025)
Similar Items
-
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models
by: Tu, Ruibo, et al.
Published: (2024) -
VoXtream: Full-Stream Text-to-Speech with Extremely Low Latency
by: Torgashov, Nikita, et al.
Published: (2025) -
VoXtream2: Full-stream TTS with dynamic speaking rate control
by: Torgashov, Nikita, et al.
Published: (2026) -
Do Bias Benchmarks Generalise? Evidence from Voice-based Evaluation of Gender Bias in SpeechLLMs
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025) -
Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias
by: Satish, Shree Harsha Bokkahalli, et al.
Published: (2025)