Saved in:
| Main Author: | Ackerman, Christopher |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.21545 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind
by: Ackerman, Christopher
Published: (2026)
by: Ackerman, Christopher
Published: (2026)
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
by: Didolkar, Aniket, et al.
Published: (2024)
by: Didolkar, Aniket, et al.
Published: (2024)
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
by: Ackerman, Christopher, et al.
Published: (2024)
by: Ackerman, Christopher, et al.
Published: (2024)
Mitigating Many-Shot Jailbreaking
by: Ackerman, Christopher M., et al.
Published: (2025)
by: Ackerman, Christopher M., et al.
Published: (2025)
Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
by: Zhou, Huilin, et al.
Published: (2026)
by: Zhou, Huilin, et al.
Published: (2026)
Knowledge-Centric Metacognitive Learning
by: Kumar, Arun, et al.
Published: (2024)
by: Kumar, Arun, et al.
Published: (2024)
The Metacognitive Probe: Five Behavioural Calibration Diagnostics for LLMs
by: Oliveira, Rafael C. T.
Published: (2026)
by: Oliveira, Rafael C. T.
Published: (2026)
Cog-Rethinker: Hierarchical Metacognitive Reinforcement Learning for LLM Reasoning
by: Sun, Zexu, et al.
Published: (2025)
by: Sun, Zexu, et al.
Published: (2025)
Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors
by: Didolkar, Aniket, et al.
Published: (2025)
by: Didolkar, Aniket, et al.
Published: (2025)
Competence-Aware AI Agents with Metacognition for Unknown Situations and Environments (MUSE)
by: Valiente, Rodolfo, et al.
Published: (2024)
by: Valiente, Rodolfo, et al.
Published: (2024)
MIRROR: A Hierarchical Benchmark for Metacognitive Calibration in Large Language Models
by: Wang, Jason Z
Published: (2026)
by: Wang, Jason Z
Published: (2026)
SCI: A Metacognitive Control for Signal Dynamics
by: Meesala, Vishal Joshua
Published: (2025)
by: Meesala, Vishal Joshua
Published: (2025)
Limits of PRM-Guided Tree Search for Mathematical Reasoning with LLMs
by: Cinquin, Tristan, et al.
Published: (2025)
by: Cinquin, Tristan, et al.
Published: (2025)
Robustness is Important: Limitations of LLMs for Data Fitting
by: Liu, Hejia, et al.
Published: (2025)
by: Liu, Hejia, et al.
Published: (2025)
When the Loop Closes: Architectural Limits of In-Context Isolation, Metacognitive Co-option, and the Two-Target Design Problem in Human-LLM Systems
by: Cheng, Z., et al.
Published: (2026)
by: Cheng, Z., et al.
Published: (2026)
Comprehension Without Competence: Architectural Limits of LLMs in Symbolic Computation and Reasoning
by: Zhang, Zheng
Published: (2025)
by: Zhang, Zheng
Published: (2025)
Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks
by: Torres, Dorothy, et al.
Published: (2026)
by: Torres, Dorothy, et al.
Published: (2026)
Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models
by: Tan, Lit Sin, et al.
Published: (2026)
by: Tan, Lit Sin, et al.
Published: (2026)
A Fragile Number Sense: Probing the Elemental Limits of Numerical Reasoning in LLMs
by: Rahman, Roussel, et al.
Published: (2025)
by: Rahman, Roussel, et al.
Published: (2025)
ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning
by: Lin, Bill Yuchen, et al.
Published: (2025)
by: Lin, Bill Yuchen, et al.
Published: (2025)
State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence
by: Aviss, Thea
Published: (2025)
by: Aviss, Thea
Published: (2025)
A Practical Guide for Evaluating LLMs and LLM-Reliant Systems
by: Rudd, Ethan M., et al.
Published: (2025)
by: Rudd, Ethan M., et al.
Published: (2025)
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
by: Huang, Wei, et al.
Published: (2024)
by: Huang, Wei, et al.
Published: (2024)
The Compliance Trap: How Structural Constraints Degrade Frontier AI Metacognition Under Adversarial Pressure
by: Kumar, Rahul
Published: (2026)
by: Kumar, Rahul
Published: (2026)
Hypertokens: Holographic Associative Memory in Tokenized LLMs
by: Augeri, Christopher James
Published: (2025)
by: Augeri, Christopher James
Published: (2025)
Modality Collapse as Mismatched Decoding: Information-Theoretic Limits of Multimodal LLMs
by: Billa, Jayadev
Published: (2026)
by: Billa, Jayadev
Published: (2026)
Data Distribution as a Lever for Guiding Optimizers Toward Superior Generalization in LLMs
by: Gangavarapu, Tushaar, et al.
Published: (2026)
by: Gangavarapu, Tushaar, et al.
Published: (2026)
seqBench: A Tunable Benchmark to Quantify Sequential Reasoning Limits of LLMs
by: Ramezanali, Mohammad, et al.
Published: (2025)
by: Ramezanali, Mohammad, et al.
Published: (2025)
Dialogue Without Limits: Constant-Sized KV Caches for Extended Responses in LLMs
by: Ghadia, Ravi, et al.
Published: (2025)
by: Ghadia, Ravi, et al.
Published: (2025)
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
by: Duan, Jinhao, et al.
Published: (2024)
by: Duan, Jinhao, et al.
Published: (2024)
Compute-Optimal LLMs Provably Generalize Better With Scale
by: Finzi, Marc, et al.
Published: (2025)
by: Finzi, Marc, et al.
Published: (2025)
LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law
by: Liu, Toni J. B., et al.
Published: (2024)
by: Liu, Toni J. B., et al.
Published: (2024)
ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers
by: Yin, Junjie, et al.
Published: (2023)
by: Yin, Junjie, et al.
Published: (2023)
AutoOR: Scalably Post-training LLMs to Autoformalize Operations Research Problems
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
by: Motwani, Sumeet Ramesh, et al.
Published: (2026)
Early Evidence of Vibe-Proving with Consumer LLMs: A Case Study on Spectral Region Characterization with ChatGPT-5.2 (Thinking)
by: Verbeken, Brecht, et al.
Published: (2026)
by: Verbeken, Brecht, et al.
Published: (2026)
On Limitations of the Transformer Architecture
by: Peng, Binghui, et al.
Published: (2024)
by: Peng, Binghui, et al.
Published: (2024)
Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity
by: Guo, Wentao, et al.
Published: (2024)
by: Guo, Wentao, et al.
Published: (2024)
LLMs Judging LLMs: A Simplex Perspective
by: Vossler, Patrick, et al.
Published: (2025)
by: Vossler, Patrick, et al.
Published: (2025)
On Limitation of Transformer for Learning HMMs
by: Hu, Jiachen, et al.
Published: (2024)
by: Hu, Jiachen, et al.
Published: (2024)
ECPO: Evidence-Coupled Policy Optimization for Evidence-Certified Candidate Ranking
by: Hu, Miaobo, et al.
Published: (2026)
by: Hu, Miaobo, et al.
Published: (2026)
Similar Items
-
Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind
by: Ackerman, Christopher
Published: (2026) -
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving
by: Didolkar, Aniket, et al.
Published: (2024) -
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
by: Ackerman, Christopher, et al.
Published: (2024) -
Mitigating Many-Shot Jailbreaking
by: Ackerman, Christopher M., et al.
Published: (2025) -
Metis: Learning to Jailbreak LLMs via Self-Evolving Metacognitive Policy Optimization
by: Zhou, Huilin, et al.
Published: (2026)