:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Cacioli, Jon-Paul
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2604.22215
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Distilling Self-Consistency into Verbal Confidence: A Pre-Registered Negative Result and Post-Hoc Rescue on Gemma 3 4B
by: Cacioli, Jon-Paul
Published: (2026)

Screen Before You Interpret: A Portable Validity Protocol for Benchmark-Based LLM Confidence Signals
by: Cacioli, Jon-Paul
Published: (2026)

Concurrent Criterion Validation of a Validity Screen for LLM Confidence Signals via Selective Prediction
by: Cacioli, Jon-Paul
Published: (2026)

Cross-Entropy Is Load-Bearing: A Pre-Registered Scope Test of the K-Way Energy Probe on Bidirectional Predictive Coding
by: Cacioli, Jon-Paul
Published: (2026)

Before You Interpret the Profile: Validity Scaling for LLM Metacognitive Self-Report
by: Cacioli, Jon-Paul
Published: (2026)

Instruction Complexity Induces Positional Collapse in Adversarial LLM Evaluation
by: Cacioli, Jon-Paul
Published: (2026)

Domain-level metacognitive monitoring in frontier LLMs: A 33-model atlas
by: Cacioli, Jon-Paul
Published: (2026)

LLMs as Signal Detectors: Sensitivity, Bias, and the Temperature-Criterion Analogy
by: Cacioli, Jon-Paul
Published: (2026)

Below-Chance Blindness: Prompted Underperformance in Small LLMs Produces Positional Bias Rather than Answer Avoidance
by: Cacioli, Jon-Paul
Published: (2026)

Do LLMs Know What They Know? Measuring Metacognitive Efficiency with Signal Detection Theory
by: Cacioli, Jon-Paul
Published: (2026)

Weber's Law in Transformer Magnitude Representations: Efficient Coding, Representational Geometry, and Psychophysical Laws in Language Models
by: Cacioli, Jon-Paul
Published: (2026)

Beyond the Mean: Within-Model Reliable Change Detection for LLM Evaluation
by: Cacioli, Jon-Paul
Published: (2026)

Option-Order Randomisation Reveals a Distributional Position Attractor in Prompted Sandbagging
by: Cacioli, Jon-Paul
Published: (2026)

Exemplar Retrieval Without Overhypothesis Induction: Limits of Distributional Sequence Learning in Early Word Learning
by: Cacioli, Jon-Paul
Published: (2026)

Categorical Perception in Large Language Model Hidden States: Structural Warping at Digit-Count Boundaries
by: Cacioli, Jon-Paul
Published: (2026)

How do LLMs Compute Verbal Confidence
by: Kumaran, Dharshan, et al.
Published: (2026)

Calibrating Verbalized Confidence with Self-Generated Distractors
by: Wang, Victor, et al.
Published: (2025)

Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
by: Cai, Hongyi, et al.
Published: (2025)

ConfTuner: Training Large Language Models to Express Their Confidence Verbally
by: Li, Yibo, et al.
Published: (2025)

LoVeC: Reinforcement Learning for Better Verbalized Confidence in Long-Form Generations
by: Zhang, Caiqi, et al.
Published: (2025)

Verbalized Confidence Triggers Self-Verification: Emergent Behavior Without Explicit Reasoning Supervision
by: Jang, Chaeyun, et al.
Published: (2025)

Quantisation Reshapes the Metacognitive Geometry of Language Models
by: Cacioli, Jon-Paul
Published: (2026)

Verbalizing LLMs' assumptions to explain and control sycophancy
by: Cheng, Myra, et al.
Published: (2026)

Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
by: Singh, Shivalika, et al.
Published: (2024)

The Metacognitive Monitoring Battery: A Cross-Domain Benchmark for LLM Self-Monitoring
by: Cacioli, Jon-Paul
Published: (2026)

GemmAr: Enhancing LLMs Through Arabic Instruction-Tuning
by: Chouikhi, Hasna, et al.
Published: (2024)

How Reliable Are Automatic Evaluation Methods for Instruction-Tuned LLMs?
by: Doostmohammadi, Ehsan, et al.
Published: (2024)

A Comparative Analysis of Instruction Fine-Tuning LLMs for Financial Text Classification
by: Fatemi, Sorouralsadat, et al.
Published: (2024)

Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning
by: Zan, Changtong, et al.
Published: (2024)

Same Geometry, Opposite Noise: Transformer Magnitude Representations Lack Scalar Variability
by: Cacioli, Jon-Paul
Published: (2026)

A Modular Approach for Clinical SLMs Driven by Synthetic Data with Pre-Instruction Tuning, Model Merging, and Clinical-Tasks Alignment
by: Corbeil, Jean-Philippe, et al.
Published: (2025)

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian
by: Nikolich, Aleksandr, et al.
Published: (2024)

TACOS: Open Tagging and Comparative Scoring for Instruction Fine-Tuning Data Selection
by: He, Xixiang, et al.
Published: (2025)

Instruction Tuning With Loss Over Instructions
by: Shi, Zhengyan, et al.
Published: (2024)

Teaching According to Talents! Instruction Tuning LLMs with Competence-Aware Curriculum Learning
by: Li, Yangning, et al.
Published: (2025)

Teaching LLMs to Plan: Logical Chain-of-Thought Instruction Tuning for Symbolic Planning
by: Verma, Pulkit, et al.
Published: (2025)

Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting
by: Luo, Yifan, et al.
Published: (2024)

Tamper-Resistant Safeguards for Open-Weight LLMs
by: Tamirisa, Rishub, et al.
Published: (2024)

Psychometric Item Validation Using Virtual Respondents with Trait-Response Mediators
by: Lim, Sungjib, et al.
Published: (2025)

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs
by: Li, Wu, et al.
Published: (2026)