:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Arias, Esteban Garces, Rodemann, Julian, Heumann, Christian
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2509.23088
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation
by: Arias, Esteban Garces, et al.
Published: (2024)

Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework
by: Arias, Esteban Garces, et al.
Published: (2024)

The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices
by: Arias, Esteban Garces, et al.
Published: (2026)

GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation
by: Ding, Yuanhao, et al.
Published: (2025)

Statistical Multicriteria Evaluation of LLM-Generated Text
by: Arias, Esteban Garces, et al.
Published: (2025)

Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation
by: Arias, Esteban Garces, et al.
Published: (2024)

Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion
by: Li, Meimingwei, et al.
Published: (2026)

A Statistical Case Against Empirical Human-AI Alignment
by: Rodemann, Julian, et al.
Published: (2025)

Modern Models, Medieval Texts: A POS Tagging Study of Old Occitan
by: Schöffel, Matthias, et al.
Published: (2025)

Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2025)

Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics
by: Ding, Yuanhao, et al.
Published: (2026)

From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2026)

Self-Reinforcing Controllable Synthesis of Rare Relational Data via Bayesian Calibration
by: Zhang, Chongsheng, et al.
Published: (2026)

Can OpenSource beat ChatGPT? -- A Comparative Study of Large Language Models for Text-to-Code Generation
by: Mayer, Luis, et al.
Published: (2024)

BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models
by: Tang, Yuzhe
Published: (2026)

Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models
by: Ji, Shihao, et al.
Published: (2025)

Geometry-Calibrated Conformal Abstention for Language Models
by: Xu, Rui, et al.
Published: (2026)

Incentive Aware AI Regulations: A Credal Characterisation
by: Singh, Anurag, et al.
Published: (2026)

How Prevalent is Gender Bias in ChatGPT? -- Exploring German and English ChatGPT Responses
by: Urchs, Stefanie, et al.
Published: (2023)

Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models
by: Li, Haoyang, et al.
Published: (2025)

Lost in Translation? Exploring the Shift in Grammatical Gender from Latin to Occitan
by: Chatterjee, Ahan, et al.
Published: (2026)

How Creative Are Large Language Models in Generating Molecules?
by: Tao, Wen, et al.
Published: (2026)

Theory-Grounded Evaluation Exposes the Authorship Gap in LLM Personalization
by: Sawant, Yash Ganpat
Published: (2026)

How do Humans and Language Models Reason About Creativity? A Comparative Analysis
by: Laverghetta Jr., Antonio, et al.
Published: (2025)

Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
by: Chhikara, Prateek
Published: (2025)

Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora
by: Urchs, Stefanie, et al.
Published: (2025)

taz2024full: Analysing German Newspapers for Gender Bias and Discrimination across Decades
by: Urchs, Stefanie, et al.
Published: (2025)

Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations
by: Gerrits, Kyo, et al.
Published: (2026)

On the Creativity of Large Language Models
by: Franceschelli, Giorgio, et al.
Published: (2023)

Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
by: Nakajima, Kumiko, et al.
Published: (2026)

Bridging the Missing-Modality Gap: Improving Text-Only Calibration of Vision Language Models
by: Kim, Mingyeong, et al.
Published: (2026)

CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
by: Hou, Zhaoyi Joey, et al.
Published: (2025)

Fill In The Gaps: Model Calibration and Generalization with Synthetic Data
by: Ba, Yang, et al.
Published: (2024)

Calibrated Surprise: An Information-Theoretic Account of Creative Quality
by: Zou, Bo, et al.
Published: (2026)

How Small Transformation Expose the Weakness of Semantic Similarity Measures
by: Nikiema, Serge Lionel, et al.
Published: (2025)

Task Calibration: Calibrating Large Language Models on Inference Tasks
by: Li, Yingjie, et al.
Published: (2024)

How Language Directions Align with Token Geometry in Multilingual LLMs
by: Kim, JaeSeong, et al.
Published: (2025)

KG-RAG: Bridging the Gap Between Knowledge and Creativity
by: Sanmartin, Diego
Published: (2024)

The Dark Patterns of Personalized Persuasion in Large Language Models: Exposing Persuasive Linguistic Features for Big Five Personality Traits in LLMs Responses
by: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Published: (2024)

The AI Gap: How Socioeconomic Status Affects Language Technology Interactions
by: Bassignana, Elisa, et al.
Published: (2025)