Saved in:
Bibliographic Details
Main Authors: Saanum, Tankred, Dayan, Peter, Schulz, Eric
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.17835
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917824971145216
author Saanum, Tankred
Dayan, Peter
Schulz, Eric
author_facet Saanum, Tankred
Dayan, Peter
Schulz, Eric
contents To solve control problems via model-based reasoning or planning, an agent needs to know how its actions affect the state of the world. The actions an agent has at its disposal often change the state of the environment in systematic ways. However, existing techniques for world modelling do not guarantee that the effect of actions are represented in such systematic ways. We introduce the Parsimonious Latent Space Model (PLSM), a world model that regularizes the latent dynamics to make the effect of the agent's actions more predictable. Our approach minimizes the mutual information between latent states and the change that an action produces in the agent's latent state, in turn minimizing the dependence the state has on the dynamics. This makes the world model softly state-invariant. We combine PLSM with different model classes used for i) future latent state prediction, ii) planning, and iii) model-free reinforcement learning. We find that our regularization improves accuracy, generalization, and performance in downstream tasks, highlighting the importance of systematic treatment of actions in world models.
format Preprint
id arxiv_https___arxiv_org_abs_2401_17835
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Simplifying Latent Dynamics with Softly State-Invariant World Models
Saanum, Tankred
Dayan, Peter
Schulz, Eric
Machine Learning
To solve control problems via model-based reasoning or planning, an agent needs to know how its actions affect the state of the world. The actions an agent has at its disposal often change the state of the environment in systematic ways. However, existing techniques for world modelling do not guarantee that the effect of actions are represented in such systematic ways. We introduce the Parsimonious Latent Space Model (PLSM), a world model that regularizes the latent dynamics to make the effect of the agent's actions more predictable. Our approach minimizes the mutual information between latent states and the change that an action produces in the agent's latent state, in turn minimizing the dependence the state has on the dynamics. This makes the world model softly state-invariant. We combine PLSM with different model classes used for i) future latent state prediction, ii) planning, and iii) model-free reinforcement learning. We find that our regularization improves accuracy, generalization, and performance in downstream tasks, highlighting the importance of systematic treatment of actions in world models.
title Simplifying Latent Dynamics with Softly State-Invariant World Models
topic Machine Learning
url https://arxiv.org/abs/2401.17835