Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lu, Chengda, Fan, Xiaoyu, Xu, Wei
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.05741
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914537474621440
author	Lu, Chengda Fan, Xiaoyu Xu, Wei
author_facet	Lu, Chengda Fan, Xiaoyu Xu, Wei
contents	While Large Language Models (LLMs) achieve strong performance across diverse tasks, their inference dynamics remain poorly understood because of the limited resolution of existing analysis tools. In this work, we identify an intrinsic magnification mechanism in transformer architectures: deeper layers inherently magnify the small changes of layer-wise confidence, providing a fine-grained confidence trajectory. Building on this insight, we introduce HyperLens, a high-resolution probe designed to trace confidence trajectories and quantify the cognitive effort during inference. Across LLMs and datasets, HyperLens reveals a consistent divergence in confidence trajectories that separates complex from simple tasks. We abstract this pattern into a quantitative cognitive effort metric. Our analysis reveals a fundamental principle: complex tasks consistently require higher cognitive effort. Finally, we provide a mechanistic diagnosis of a common side effect of standard Supervised Fine-Tuning (SFT): it can reduce cognitive effort and consequently degrade performance on in-domain tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_05741
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	HyperLens: Quantifying Cognitive Effort in LLMs with Fine-grained Confidence Trajectory Lu, Chengda Fan, Xiaoyu Xu, Wei Artificial Intelligence While Large Language Models (LLMs) achieve strong performance across diverse tasks, their inference dynamics remain poorly understood because of the limited resolution of existing analysis tools. In this work, we identify an intrinsic magnification mechanism in transformer architectures: deeper layers inherently magnify the small changes of layer-wise confidence, providing a fine-grained confidence trajectory. Building on this insight, we introduce HyperLens, a high-resolution probe designed to trace confidence trajectories and quantify the cognitive effort during inference. Across LLMs and datasets, HyperLens reveals a consistent divergence in confidence trajectories that separates complex from simple tasks. We abstract this pattern into a quantitative cognitive effort metric. Our analysis reveals a fundamental principle: complex tasks consistently require higher cognitive effort. Finally, we provide a mechanistic diagnosis of a common side effect of standard Supervised Fine-Tuning (SFT): it can reduce cognitive effort and consequently degrade performance on in-domain tasks.
title	HyperLens: Quantifying Cognitive Effort in LLMs with Fine-grained Confidence Trajectory
topic	Artificial Intelligence
url	https://arxiv.org/abs/2605.05741

Similar Items