Saved in:
Bibliographic Details
Main Authors: Timoneda, Joan C., Vera, Sebastián Vallejo
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.04874
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909528739545088
author Timoneda, Joan C.
Vera, Sebastián Vallejo
author_facet Timoneda, Joan C.
Vera, Sebastián Vallejo
contents Generative Large Language Models (LLMs) have shown promising results in text annotation using zero-shot and few-shot learning. Yet these approaches do not allow the model to retain information from previous annotations, making each response independent from the preceding ones. This raises the question of whether model memory -- the LLM having knowledge about its own previous annotations in the same task -- affects performance. In this article, using OpenAI's GPT-4o and Meta's Llama 3.1 on two political science datasets, we demonstrate that allowing the model to retain information about its own previous classifications yields significant performance improvements: between 5 and 25\% when compared to zero-shot and few-shot learning. Moreover, memory reinforcement, a novel approach we propose that combines model memory and reinforcement learning, yields additional performance gains in three out of our four tests. These findings have important implications for applied researchers looking to improve performance and efficiency in LLM annotation tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2503_04874
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Memory Is All You Need: Testing How Model Memory Affects LLM Performance in Annotation Tasks
Timoneda, Joan C.
Vera, Sebastián Vallejo
Computation and Language
Machine Learning
Generative Large Language Models (LLMs) have shown promising results in text annotation using zero-shot and few-shot learning. Yet these approaches do not allow the model to retain information from previous annotations, making each response independent from the preceding ones. This raises the question of whether model memory -- the LLM having knowledge about its own previous annotations in the same task -- affects performance. In this article, using OpenAI's GPT-4o and Meta's Llama 3.1 on two political science datasets, we demonstrate that allowing the model to retain information about its own previous classifications yields significant performance improvements: between 5 and 25\% when compared to zero-shot and few-shot learning. Moreover, memory reinforcement, a novel approach we propose that combines model memory and reinforcement learning, yields additional performance gains in three out of our four tests. These findings have important implications for applied researchers looking to improve performance and efficiency in LLM annotation tasks.
title Memory Is All You Need: Testing How Model Memory Affects LLM Performance in Annotation Tasks
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2503.04874