Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lee, Jung H., Vijayan, Sujith
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01687
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909030504464384
author	Lee, Jung H. Vijayan, Sujith
author_facet	Lee, Jung H. Vijayan, Sujith
contents	Large language models (LLMs) were invented for natural language tasks such as translation, but they have proved that they can perform highly complex functions across domains. Additionally, they have been thought to develop new skills without being trained on them. These learning capabilities lead to LLMs adoption in a wide range of domains. Thus, it is imperative that we understand their operating mechanisms and limitations for proper diagnostics and repair. The earlier studies proposed that high level concepts are encoded as linear directions in LLMs activation space and that the geometry of embeddings have semantic meanings. Inspired by these studies, we hypothesize that LLMs may use subspaces and vector algebra in subspaces to perform tasks. To address this hypothesis, we analyze LLMs' functional modules and residual streams collected from LLMs engaging in in-context learning (ICL), one of the emergent abilities. Our analyses suggest that 1) LLMs can create subspaces, where evidence can be accumulated and 2) ICL tasks can be solved via simple algebraic operations in subspaces.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_01687
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Functional Subspace, where language models can use vector algebra to solve problems Lee, Jung H. Vijayan, Sujith Computation and Language Artificial Intelligence Large language models (LLMs) were invented for natural language tasks such as translation, but they have proved that they can perform highly complex functions across domains. Additionally, they have been thought to develop new skills without being trained on them. These learning capabilities lead to LLMs adoption in a wide range of domains. Thus, it is imperative that we understand their operating mechanisms and limitations for proper diagnostics and repair. The earlier studies proposed that high level concepts are encoded as linear directions in LLMs activation space and that the geometry of embeddings have semantic meanings. Inspired by these studies, we hypothesize that LLMs may use subspaces and vector algebra in subspaces to perform tasks. To address this hypothesis, we analyze LLMs' functional modules and residual streams collected from LLMs engaging in in-context learning (ICL), one of the emergent abilities. Our analyses suggest that 1) LLMs can create subspaces, where evidence can be accumulated and 2) ICL tasks can be solved via simple algebraic operations in subspaces.
title	Functional Subspace, where language models can use vector algebra to solve problems
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2602.01687

Similar Items