Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Watahiki, Hayato, Iwase, Ryo, Unno, Ryosuke, Tsuruoka, Yoshimasa
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2407.16912
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909266725568512
author	Watahiki, Hayato Iwase, Ryo Unno, Ryosuke Tsuruoka, Yoshimasa
author_facet	Watahiki, Hayato Iwase, Ryo Unno, Ryosuke Tsuruoka, Yoshimasa
contents	Transferring learned skills across diverse situations remains a fundamental challenge for autonomous agents, particularly when agents are not allowed to interact with an exact target setup. While prior approaches have predominantly focused on learning domain translation, they often struggle with handling significant domain gaps or out-of-distribution tasks. In this paper, we present a simple approach for cross-domain policy transfer that learns a shared latent representation across domains and a common abstract policy on top of it. Our approach leverages multi-domain behavioral cloning on unaligned trajectories of proxy tasks and employs maximum mean discrepancy (MMD) as a regularization term to encourage cross-domain alignment. The MMD regularization better preserves structures of latent state distributions than commonly used domain-discriminative distribution matching, leading to higher transfer performance. Moreover, our approach involves training only one multi-domain policy, which makes extension easier than existing methods. Empirical evaluations demonstrate the efficacy of our method across various domain shifts, especially in scenarios where exact domain translation is challenging, such as cross-morphology or cross-viewpoint settings. Our ablation studies further reveal that multi-domain behavioral cloning implicitly contributes to representation alignment alongside domain-adversarial regularization.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_16912
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning Watahiki, Hayato Iwase, Ryo Unno, Ryosuke Tsuruoka, Yoshimasa Machine Learning Transferring learned skills across diverse situations remains a fundamental challenge for autonomous agents, particularly when agents are not allowed to interact with an exact target setup. While prior approaches have predominantly focused on learning domain translation, they often struggle with handling significant domain gaps or out-of-distribution tasks. In this paper, we present a simple approach for cross-domain policy transfer that learns a shared latent representation across domains and a common abstract policy on top of it. Our approach leverages multi-domain behavioral cloning on unaligned trajectories of proxy tasks and employs maximum mean discrepancy (MMD) as a regularization term to encourage cross-domain alignment. The MMD regularization better preserves structures of latent state distributions than commonly used domain-discriminative distribution matching, leading to higher transfer performance. Moreover, our approach involves training only one multi-domain policy, which makes extension easier than existing methods. Empirical evaluations demonstrate the efficacy of our method across various domain shifts, especially in scenarios where exact domain translation is challenging, such as cross-morphology or cross-viewpoint settings. Our ablation studies further reveal that multi-domain behavioral cloning implicitly contributes to representation alignment alongside domain-adversarial regularization.
title	Cross-Domain Policy Transfer by Representation Alignment via Multi-Domain Behavioral Cloning
topic	Machine Learning
url	https://arxiv.org/abs/2407.16912

Similar Items