Saved in:
| Main Authors: | Arias, Esteban Garces, Rodemann, Julian, Heumann, Christian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.23088 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation
by: Arias, Esteban Garces, et al.
Published: (2024)
by: Arias, Esteban Garces, et al.
Published: (2024)
Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework
by: Arias, Esteban Garces, et al.
Published: (2024)
by: Arias, Esteban Garces, et al.
Published: (2024)
The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices
by: Arias, Esteban Garces, et al.
Published: (2026)
by: Arias, Esteban Garces, et al.
Published: (2026)
GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation
by: Ding, Yuanhao, et al.
Published: (2025)
by: Ding, Yuanhao, et al.
Published: (2025)
Statistical Multicriteria Evaluation of LLM-Generated Text
by: Arias, Esteban Garces, et al.
Published: (2025)
by: Arias, Esteban Garces, et al.
Published: (2025)
Decoding Decoded: Understanding Hyperparameter Effects in Open-Ended Text Generation
by: Arias, Esteban Garces, et al.
Published: (2024)
by: Arias, Esteban Garces, et al.
Published: (2024)
Beyond Temperature: Hyperfitting as a Late-Stage Geometric Expansion
by: Li, Meimingwei, et al.
Published: (2026)
by: Li, Meimingwei, et al.
Published: (2026)
A Statistical Case Against Empirical Human-AI Alignment
by: Rodemann, Julian, et al.
Published: (2025)
by: Rodemann, Julian, et al.
Published: (2025)
Modern Models, Medieval Texts: A POS Tagging Study of Old Occitan
by: Schöffel, Matthias, et al.
Published: (2025)
by: Schöffel, Matthias, et al.
Published: (2025)
Unveiling Factors for Enhanced POS Tagging: A Study of Low-Resource Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2025)
by: Schöffel, Matthias, et al.
Published: (2025)
Min-$k$ Sampling: Decoupling Truncation from Temperature Scaling via Relative Logit Dynamics
by: Ding, Yuanhao, et al.
Published: (2026)
by: Ding, Yuanhao, et al.
Published: (2026)
From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages
by: Schöffel, Matthias, et al.
Published: (2026)
by: Schöffel, Matthias, et al.
Published: (2026)
Self-Reinforcing Controllable Synthesis of Rare Relational Data via Bayesian Calibration
by: Zhang, Chongsheng, et al.
Published: (2026)
by: Zhang, Chongsheng, et al.
Published: (2026)
Can OpenSource beat ChatGPT? -- A Comparative Study of Large Language Models for Text-to-Code Generation
by: Mayer, Luis, et al.
Published: (2024)
by: Mayer, Luis, et al.
Published: (2024)
BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models
by: Tang, Yuzhe
Published: (2026)
by: Tang, Yuzhe
Published: (2026)
Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models
by: Ji, Shihao, et al.
Published: (2025)
by: Ji, Shihao, et al.
Published: (2025)
Geometry-Calibrated Conformal Abstention for Language Models
by: Xu, Rui, et al.
Published: (2026)
by: Xu, Rui, et al.
Published: (2026)
Incentive Aware AI Regulations: A Credal Characterisation
by: Singh, Anurag, et al.
Published: (2026)
by: Singh, Anurag, et al.
Published: (2026)
How Prevalent is Gender Bias in ChatGPT? -- Exploring German and English ChatGPT Responses
by: Urchs, Stefanie, et al.
Published: (2023)
by: Urchs, Stefanie, et al.
Published: (2023)
Exposing Numeracy Gaps: A Benchmark to Evaluate Fundamental Numerical Abilities in Large Language Models
by: Li, Haoyang, et al.
Published: (2025)
by: Li, Haoyang, et al.
Published: (2025)
Lost in Translation? Exploring the Shift in Grammatical Gender from Latin to Occitan
by: Chatterjee, Ahan, et al.
Published: (2026)
by: Chatterjee, Ahan, et al.
Published: (2026)
How Creative Are Large Language Models in Generating Molecules?
by: Tao, Wen, et al.
Published: (2026)
by: Tao, Wen, et al.
Published: (2026)
Theory-Grounded Evaluation Exposes the Authorship Gap in LLM Personalization
by: Sawant, Yash Ganpat
Published: (2026)
by: Sawant, Yash Ganpat
Published: (2026)
How do Humans and Language Models Reason About Creativity? A Comparative Analysis
by: Laverghetta Jr., Antonio, et al.
Published: (2025)
by: Laverghetta Jr., Antonio, et al.
Published: (2025)
Mind the Confidence Gap: Overconfidence, Calibration, and Distractor Effects in Large Language Models
by: Chhikara, Prateek
Published: (2025)
by: Chhikara, Prateek
Published: (2025)
Fair Play in the Newsroom: Actor-Based Filtering Gender Discrimination in Text Corpora
by: Urchs, Stefanie, et al.
Published: (2025)
by: Urchs, Stefanie, et al.
Published: (2025)
taz2024full: Analysing German Newspapers for Gender Bias and Discrimination across Decades
by: Urchs, Stefanie, et al.
Published: (2025)
by: Urchs, Stefanie, et al.
Published: (2025)
Creativity Bias: How Machine Evaluation Struggles with Creativity in Literary Translations
by: Gerrits, Kyo, et al.
Published: (2026)
by: Gerrits, Kyo, et al.
Published: (2026)
On the Creativity of Large Language Models
by: Franceschelli, Giorgio, et al.
Published: (2023)
by: Franceschelli, Giorgio, et al.
Published: (2023)
Beyond Divergent Creativity: A Human-Based Evaluation of Creativity in Large Language Models
by: Nakajima, Kumiko, et al.
Published: (2026)
by: Nakajima, Kumiko, et al.
Published: (2026)
Bridging the Missing-Modality Gap: Improving Text-Only Calibration of Vision Language Models
by: Kim, Mingyeong, et al.
Published: (2026)
by: Kim, Mingyeong, et al.
Published: (2026)
CreativityPrism: A Holistic Evaluation Framework for Large Language Model Creativity
by: Hou, Zhaoyi Joey, et al.
Published: (2025)
by: Hou, Zhaoyi Joey, et al.
Published: (2025)
Fill In The Gaps: Model Calibration and Generalization with Synthetic Data
by: Ba, Yang, et al.
Published: (2024)
by: Ba, Yang, et al.
Published: (2024)
Calibrated Surprise: An Information-Theoretic Account of Creative Quality
by: Zou, Bo, et al.
Published: (2026)
by: Zou, Bo, et al.
Published: (2026)
How Small Transformation Expose the Weakness of Semantic Similarity Measures
by: Nikiema, Serge Lionel, et al.
Published: (2025)
by: Nikiema, Serge Lionel, et al.
Published: (2025)
Task Calibration: Calibrating Large Language Models on Inference Tasks
by: Li, Yingjie, et al.
Published: (2024)
by: Li, Yingjie, et al.
Published: (2024)
How Language Directions Align with Token Geometry in Multilingual LLMs
by: Kim, JaeSeong, et al.
Published: (2025)
by: Kim, JaeSeong, et al.
Published: (2025)
KG-RAG: Bridging the Gap Between Knowledge and Creativity
by: Sanmartin, Diego
Published: (2024)
by: Sanmartin, Diego
Published: (2024)
The Dark Patterns of Personalized Persuasion in Large Language Models: Exposing Persuasive Linguistic Features for Big Five Personality Traits in LLMs Responses
by: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Published: (2024)
by: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Published: (2024)
The AI Gap: How Socioeconomic Status Affects Language Technology Interactions
by: Bassignana, Elisa, et al.
Published: (2025)
by: Bassignana, Elisa, et al.
Published: (2025)
Similar Items
-
Adaptive Contrastive Search: Uncertainty-Guided Decoding for Open-Ended Text Generation
by: Arias, Esteban Garces, et al.
Published: (2024) -
Towards Better Open-Ended Text Generation: A Multicriteria Evaluation Framework
by: Arias, Esteban Garces, et al.
Published: (2024) -
The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices
by: Arias, Esteban Garces, et al.
Published: (2026) -
GUARD: Glocal Uncertainty-Aware Robust Decoding for Effective and Efficient Open-Ended Text Generation
by: Ding, Yuanhao, et al.
Published: (2025) -
Statistical Multicriteria Evaluation of LLM-Generated Text
by: Arias, Esteban Garces, et al.
Published: (2025)