Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Mengyu, Zhi, Xiaoying, Li, Zhiyi, Schmucker, Robin, Cohen, Shay B., Ma, Tiejun, Silavong, Fran
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence Computer Vision and Pattern Recognition Information Retrieval
Online Access:	https://arxiv.org/abs/2604.22939
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

While the next-token prediction (NTP) paradigm enables large language models (LLMs) to express their intrinsic knowledge, its sequential nature constrains performance on specialized, non-generative tasks. We attribute this performance bottleneck to the LLMs' knowledge expression mechanism, rather than to deficiencies in knowledge acquisition. To address this, we propose Self-Knowledge Re-expression (SKR), a novel, task-agnostic adaptation method. SKR transforms the LLM's output from generic token generation to highly efficient, task-specific expression. SKR is a fully local method that uses only unannotated data, requiring neither human supervision nor model distillation. Experiments on a large financial document dataset demonstrate substantial improvements: over 40% in Recall@1 for information retrieval tasks, over 76% reduction in object detection latency, and over 33% increase in anomaly detection AUPRC. Our results on the MMDocRAG dataset surpass those of leading retrieval models by at least 12.6%.

Similar Items