Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Timoneda, Joan C., Vera, Sebastián Vallejo
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2503.04874
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909528739545088
author	Timoneda, Joan C. Vera, Sebastián Vallejo
author_facet	Timoneda, Joan C. Vera, Sebastián Vallejo
contents	Generative Large Language Models (LLMs) have shown promising results in text annotation using zero-shot and few-shot learning. Yet these approaches do not allow the model to retain information from previous annotations, making each response independent from the preceding ones. This raises the question of whether model memory -- the LLM having knowledge about its own previous annotations in the same task -- affects performance. In this article, using OpenAI's GPT-4o and Meta's Llama 3.1 on two political science datasets, we demonstrate that allowing the model to retain information about its own previous classifications yields significant performance improvements: between 5 and 25\% when compared to zero-shot and few-shot learning. Moreover, memory reinforcement, a novel approach we propose that combines model memory and reinforcement learning, yields additional performance gains in three out of our four tests. These findings have important implications for applied researchers looking to improve performance and efficiency in LLM annotation tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_04874
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Memory Is All You Need: Testing How Model Memory Affects LLM Performance in Annotation Tasks Timoneda, Joan C. Vera, Sebastián Vallejo Computation and Language Machine Learning Generative Large Language Models (LLMs) have shown promising results in text annotation using zero-shot and few-shot learning. Yet these approaches do not allow the model to retain information from previous annotations, making each response independent from the preceding ones. This raises the question of whether model memory -- the LLM having knowledge about its own previous annotations in the same task -- affects performance. In this article, using OpenAI's GPT-4o and Meta's Llama 3.1 on two political science datasets, we demonstrate that allowing the model to retain information about its own previous classifications yields significant performance improvements: between 5 and 25\% when compared to zero-shot and few-shot learning. Moreover, memory reinforcement, a novel approach we propose that combines model memory and reinforcement learning, yields additional performance gains in three out of our four tests. These findings have important implications for applied researchers looking to improve performance and efficiency in LLM annotation tasks.
title	Memory Is All You Need: Testing How Model Memory Affects LLM Performance in Annotation Tasks
topic	Computation and Language Machine Learning
url	https://arxiv.org/abs/2503.04874

Similar Items