Saved in:
Bibliographic Details
Main Authors: Unold, Olgierd, Franczyk, Stanisław
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.09400
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908765010264064
author Unold, Olgierd
Franczyk, Stanisław
author_facet Unold, Olgierd
Franczyk, Stanisław
contents This paper introduces ACS2HER, a novel integration of the Anticipatory Classifier System (ACS2) with the Hindsight Experience Replay (HER) mechanism. While ACS2 is highly effective at building cognitive maps through latent learning, its performance often stagnates in environments characterized by sparse rewards. We propose a specific architectural variant that triggers hindsight learning when the agent fails to reach its primary goal, re-labeling visited states as virtual goals to densify the learning signal. The proposed model was evaluated on two benchmarks: the deterministic \texttt{Maze 6} and the stochastic \texttt{FrozenLake}. The results demonstrate that ACS2HER significantly accelerates knowledge acquisition and environmental mastery compared to the standard ACS2. However, this efficiency gain is accompanied by increased computational overhead and a substantial expansion in classifier numerosity. This work provides the first analysis of combining anticipatory mechanisms with retrospective goal-relabeling in Learning Classifier Systems.
format Preprint
id arxiv_https___arxiv_org_abs_2601_09400
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Preliminary Tests of the Anticipatory Classifier System with Hindsight Experience Replay
Unold, Olgierd
Franczyk, Stanisław
Machine Learning
This paper introduces ACS2HER, a novel integration of the Anticipatory Classifier System (ACS2) with the Hindsight Experience Replay (HER) mechanism. While ACS2 is highly effective at building cognitive maps through latent learning, its performance often stagnates in environments characterized by sparse rewards. We propose a specific architectural variant that triggers hindsight learning when the agent fails to reach its primary goal, re-labeling visited states as virtual goals to densify the learning signal. The proposed model was evaluated on two benchmarks: the deterministic \texttt{Maze 6} and the stochastic \texttt{FrozenLake}. The results demonstrate that ACS2HER significantly accelerates knowledge acquisition and environmental mastery compared to the standard ACS2. However, this efficiency gain is accompanied by increased computational overhead and a substantial expansion in classifier numerosity. This work provides the first analysis of combining anticipatory mechanisms with retrospective goal-relabeling in Learning Classifier Systems.
title Preliminary Tests of the Anticipatory Classifier System with Hindsight Experience Replay
topic Machine Learning
url https://arxiv.org/abs/2601.09400