Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ju, Zhaoxun, Yang, Chao, Wang, Hongbo, Qiao, Yu, Sun, Fuchun
Format:	Preprint
Published:	2024
Subjects:	Robotics Artificial Intelligence I.2.6
Online Access:	https://arxiv.org/abs/2402.17511
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913245443391488
author	Ju, Zhaoxun Yang, Chao Wang, Hongbo Qiao, Yu Sun, Fuchun
author_facet	Ju, Zhaoxun Yang, Chao Wang, Hongbo Qiao, Yu Sun, Fuchun
contents	Language-conditioned robot behavior plays a vital role in executing complex tasks by associating human commands or instructions with perception and actions. The ability to compose long-horizon tasks based on unconstrained language instructions necessitates the acquisition of a diverse set of general-purpose skills. However, acquiring inherent primitive skills in a coupled and long-horizon environment without external rewards or human supervision presents significant challenges. In this paper, we evaluate the relationship between skills and language instructions from a mathematical perspective, employing two forms of mutual information within the framework of language-conditioned policy learning. To maximize the mutual information between language and skills in an unsupervised manner, we propose an end-to-end imitation learning approach known as Language Conditioned Skill Discovery (LCSD). Specifically, we utilize vector quantization to learn discrete latent skills and leverage skill sequences of trajectories to reconstruct high-level semantic instructions. Through extensive experiments on language-conditioned robotic navigation and manipulation tasks, encompassing BabyAI, LORel, and CALVIN, we demonstrate the superiority of our method over prior works. Our approach exhibits enhanced generalization capabilities towards unseen tasks, improved skill interpretability, and notably higher rates of task completion success.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_17511
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning Ju, Zhaoxun Yang, Chao Wang, Hongbo Qiao, Yu Sun, Fuchun Robotics Artificial Intelligence I.2.6 Language-conditioned robot behavior plays a vital role in executing complex tasks by associating human commands or instructions with perception and actions. The ability to compose long-horizon tasks based on unconstrained language instructions necessitates the acquisition of a diverse set of general-purpose skills. However, acquiring inherent primitive skills in a coupled and long-horizon environment without external rewards or human supervision presents significant challenges. In this paper, we evaluate the relationship between skills and language instructions from a mathematical perspective, employing two forms of mutual information within the framework of language-conditioned policy learning. To maximize the mutual information between language and skills in an unsupervised manner, we propose an end-to-end imitation learning approach known as Language Conditioned Skill Discovery (LCSD). Specifically, we utilize vector quantization to learn discrete latent skills and leverage skill sequences of trajectories to reconstruct high-level semantic instructions. Through extensive experiments on language-conditioned robotic navigation and manipulation tasks, encompassing BabyAI, LORel, and CALVIN, we demonstrate the superiority of our method over prior works. Our approach exhibits enhanced generalization capabilities towards unseen tasks, improved skill interpretability, and notably higher rates of task completion success.
title	Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning
topic	Robotics Artificial Intelligence I.2.6
url	https://arxiv.org/abs/2402.17511

Similar Items