Saved in:
Bibliographic Details
Main Authors: Lu, Chengda, Fan, Xiaoyu, Xu, Wei
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.05741
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914537474621440
author Lu, Chengda
Fan, Xiaoyu
Xu, Wei
author_facet Lu, Chengda
Fan, Xiaoyu
Xu, Wei
contents While Large Language Models (LLMs) achieve strong performance across diverse tasks, their inference dynamics remain poorly understood because of the limited resolution of existing analysis tools. In this work, we identify an intrinsic magnification mechanism in transformer architectures: deeper layers inherently magnify the small changes of layer-wise confidence, providing a fine-grained confidence trajectory. Building on this insight, we introduce HyperLens, a high-resolution probe designed to trace confidence trajectories and quantify the cognitive effort during inference. Across LLMs and datasets, HyperLens reveals a consistent divergence in confidence trajectories that separates complex from simple tasks. We abstract this pattern into a quantitative cognitive effort metric. Our analysis reveals a fundamental principle: complex tasks consistently require higher cognitive effort. Finally, we provide a mechanistic diagnosis of a common side effect of standard Supervised Fine-Tuning (SFT): it can reduce cognitive effort and consequently degrade performance on in-domain tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2605_05741
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle HyperLens: Quantifying Cognitive Effort in LLMs with Fine-grained Confidence Trajectory
Lu, Chengda
Fan, Xiaoyu
Xu, Wei
Artificial Intelligence
While Large Language Models (LLMs) achieve strong performance across diverse tasks, their inference dynamics remain poorly understood because of the limited resolution of existing analysis tools. In this work, we identify an intrinsic magnification mechanism in transformer architectures: deeper layers inherently magnify the small changes of layer-wise confidence, providing a fine-grained confidence trajectory. Building on this insight, we introduce HyperLens, a high-resolution probe designed to trace confidence trajectories and quantify the cognitive effort during inference. Across LLMs and datasets, HyperLens reveals a consistent divergence in confidence trajectories that separates complex from simple tasks. We abstract this pattern into a quantitative cognitive effort metric. Our analysis reveals a fundamental principle: complex tasks consistently require higher cognitive effort. Finally, we provide a mechanistic diagnosis of a common side effect of standard Supervised Fine-Tuning (SFT): it can reduce cognitive effort and consequently degrade performance on in-domain tasks.
title HyperLens: Quantifying Cognitive Effort in LLMs with Fine-grained Confidence Trajectory
topic Artificial Intelligence
url https://arxiv.org/abs/2605.05741