:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Ackerman, Christopher
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.21545
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind
by: Ackerman, Christopher
Published: (2026)

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
by: Didolkar, Aniket, et al.
Published: (2024)

Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
by: Ackerman, Christopher, et al.
Published: (2024)

Mitigating Many-Shot Jailbreaking
by: Ackerman, Christopher M., et al.
Published: (2025)

Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
by: Zhou, Huilin, et al.
Published: (2026)

Knowledge-Centric Metacognitive Learning
by: Kumar, Arun, et al.
Published: (2024)

The Metacognitive Probe: Five Behavioural Calibration Diagnostics for LLMs
by: Oliveira, Rafael C. T.
Published: (2026)

Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
by: Sun, Zexu, et al.
Published: (2025)

Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors
by: Didolkar, Aniket, et al.
Published: (2025)

Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE)
by: Valiente, Rodolfo, et al.
Published: (2024)

MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models
by: Wang, Jason Z
Published: (2026)

SCI: A Metacognitive Control for Signal Dynamics
by: Meesala, Vishal Joshua
Published: (2025)

Limits of PRM-Guided Tree Search for Mathematical Reasoning with LLMs
by: Cinquin, Tristan, et al.
Published: (2025)

Robustness is Important: Limitations of LLMs for Data Fitting
by: Liu, Hejia, et al.
Published: (2025)

When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems
by: Cheng, Z., et al.
Published: (2026)

Comprehension Without Competence: Architectural Limits of LLMs in Symbolic Computation and Reasoning
by: Zhang, Zheng
Published: (2025)

Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks
by: Torres, Dorothy, et al.
Published: (2026)

Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models
by: Tan, Lit Sin, et al.
Published: (2026)

A Fragile Number Sense: Probing the Elemental Limits of Numerical Reasoning in LLMs
by: Rahman, Roussel, et al.
Published: (2025)

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
by: Lin, Bill Yuchen, et al.
Published: (2025)

State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence
by: Aviss, Thea
Published: (2025)

A Practical Guide for Evaluating LLMs and LLM-Reliant Systems
by: Rudd, Ethan M., et al.
Published: (2025)

BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
by: Huang, Wei, et al.
Published: (2024)

The Compliance Trap: How Structural Constraints Degrade Frontier AI Metacognition Under Adversarial Pressure
by: Kumar, Rahul
Published: (2026)

Hypertokens: Holographic Associative Memory in Tokenized LLMs
by: Augeri, Christopher James
Published: (2025)

Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs
by: Billa, Jayadev
Published: (2026)

Data Distribution as a Lever for Guiding Optimizers Toward Superior Generalization in LLMs
by: Gangavarapu, Tushaar, et al.
Published: (2026)

seqBench: A Tunable Benchmark to Quantify Sequential Reasoning Limits of LLMs
by: Ramezanali, Mohammad, et al.
Published: (2025)

Dialogue Without Limits: Constant-Sized KV Caches for Extended Responses in LLMs
by: Ghadia, Ravi, et al.
Published: (2025)

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
by: Duan, Jinhao, et al.
Published: (2024)

Compute-Optimal LLMs Provably Generalize Better With Scale
by: Finzi, Marc, et al.
Published: (2025)

LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law
by: Liu, Toni J. B., et al.
Published: (2024)

ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
by: Yin, Junjie, et al.
Published: (2023)

AutoOR: Scalably Post-training LLMs to Autoformalize Operations Research Problems
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)

Early Evidence of Vibe-Proving with Consumer LLMs: A Case Study on Spectral Region Characterization with ChatGPT-5.2 (Thinking)
by: Verbeken, Brecht, et al.
Published: (2026)

On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
by: Guo, Wentao, et al.
Published: (2024)

LLMs Judging LLMs: A Simplex Perspective
by: Vossler, Patrick, et al.
Published: (2025)

On Limitation of Transformer for Learning HMMs
by: Hu, Jiachen, et al.
Published: (2024)

ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking
by: Hu, Miaobo, et al.
Published: (2026)