Saved in:
Bibliographic Details
Main Authors: Correia, André, Alexandre, Luís A.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.07943
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913549589151744
author Correia, André
Alexandre, Luís A.
author_facet Correia, André
Alexandre, Luís A.
contents Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and subsequently, the Hierarchical Decision Transformer (HDT), presented Transformer-based approaches to learn task policies. Recently, the Mamba architecture has shown to outperform Transformers across various task domains. In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models. Through extensive experimentation across diverse environments such as OpenAI Gym and D4RL, leveraging varying demonstration data sets, we demonstrate the superiority of Mamba models over their Transformer counterparts in a majority of tasks. Results show that DM outperforms other methods in most settings. The code can be found at https://github.com/meowatthemoon/DecisionMamba.
format Preprint
id arxiv_https___arxiv_org_abs_2405_07943
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Decision Mamba Architectures
Correia, André
Alexandre, Luís A.
Machine Learning
Artificial Intelligence
Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and subsequently, the Hierarchical Decision Transformer (HDT), presented Transformer-based approaches to learn task policies. Recently, the Mamba architecture has shown to outperform Transformers across various task domains. In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models. Through extensive experimentation across diverse environments such as OpenAI Gym and D4RL, leveraging varying demonstration data sets, we demonstrate the superiority of Mamba models over their Transformer counterparts in a majority of tasks. Results show that DM outperforms other methods in most settings. The code can be found at https://github.com/meowatthemoon/DecisionMamba.
title Decision Mamba Architectures
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2405.07943