Saved in:
Bibliographic Details
Main Authors: Lee, Jung H., Vijayan, Sujith
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.01687
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909030504464384
author Lee, Jung H.
Vijayan, Sujith
author_facet Lee, Jung H.
Vijayan, Sujith
contents Large language models (LLMs) were invented for natural language tasks such as translation, but they have proved that they can perform highly complex functions across domains. Additionally, they have been thought to develop new skills without being trained on them. These learning capabilities lead to LLMs adoption in a wide range of domains. Thus, it is imperative that we understand their operating mechanisms and limitations for proper diagnostics and repair. The earlier studies proposed that high level concepts are encoded as linear directions in LLMs activation space and that the geometry of embeddings have semantic meanings. Inspired by these studies, we hypothesize that LLMs may use subspaces and vector algebra in subspaces to perform tasks. To address this hypothesis, we analyze LLMs' functional modules and residual streams collected from LLMs engaging in in-context learning (ICL), one of the emergent abilities. Our analyses suggest that 1) LLMs can create subspaces, where evidence can be accumulated and 2) ICL tasks can be solved via simple algebraic operations in subspaces.
format Preprint
id arxiv_https___arxiv_org_abs_2602_01687
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Functional Subspace, where language models can use vector algebra to solve problems
Lee, Jung H.
Vijayan, Sujith
Computation and Language
Artificial Intelligence
Large language models (LLMs) were invented for natural language tasks such as translation, but they have proved that they can perform highly complex functions across domains. Additionally, they have been thought to develop new skills without being trained on them. These learning capabilities lead to LLMs adoption in a wide range of domains. Thus, it is imperative that we understand their operating mechanisms and limitations for proper diagnostics and repair. The earlier studies proposed that high level concepts are encoded as linear directions in LLMs activation space and that the geometry of embeddings have semantic meanings. Inspired by these studies, we hypothesize that LLMs may use subspaces and vector algebra in subspaces to perform tasks. To address this hypothesis, we analyze LLMs' functional modules and residual streams collected from LLMs engaging in in-context learning (ICL), one of the emergent abilities. Our analyses suggest that 1) LLMs can create subspaces, where evidence can be accumulated and 2) ICL tasks can be solved via simple algebraic operations in subspaces.
title Functional Subspace, where language models can use vector algebra to solve problems
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2602.01687