Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Zeyu, Li, Yao-Hui, Li, Xin, Zang, Hongyu, Laroche, Romain, Islam, Riashat
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.01316
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908550656163840
author	Wang, Zeyu Li, Yao-Hui Li, Xin Zang, Hongyu Laroche, Romain Islam, Riashat
author_facet	Wang, Zeyu Li, Yao-Hui Li, Xin Zang, Hongyu Laroche, Romain Islam, Riashat
contents	Multi-View Reinforcement Learning (MVRL) seeks to provide agents with multi-view observations, enabling them to perceive environment with greater effectiveness and precision. Recent advancements in MVRL focus on extracting latent representations from multiview observations and leveraging them in control tasks. However, it is not straightforward to learn compact and task-relevant representations, particularly in the presence of redundancy, distracting information, or missing views. In this paper, we propose Multi-view Fusion State for Control (MFSC), firstly incorporating bisimulation metric learning into MVRL to learn task-relevant representations. Furthermore, we propose a multiview-based mask and latent reconstruction auxiliary task that exploits shared information across views and improves MFSC's robustness in missing views by introducing a mask token. Extensive experimental results demonstrate that our method outperforms existing approaches in MVRL tasks. Even in more realistic scenarios with interference or missing views, MFSC consistently maintains high performance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_01316
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Learning Fused State Representations for Control from Multi-View Observations Wang, Zeyu Li, Yao-Hui Li, Xin Zang, Hongyu Laroche, Romain Islam, Riashat Machine Learning Artificial Intelligence Multi-View Reinforcement Learning (MVRL) seeks to provide agents with multi-view observations, enabling them to perceive environment with greater effectiveness and precision. Recent advancements in MVRL focus on extracting latent representations from multiview observations and leveraging them in control tasks. However, it is not straightforward to learn compact and task-relevant representations, particularly in the presence of redundancy, distracting information, or missing views. In this paper, we propose Multi-view Fusion State for Control (MFSC), firstly incorporating bisimulation metric learning into MVRL to learn task-relevant representations. Furthermore, we propose a multiview-based mask and latent reconstruction auxiliary task that exploits shared information across views and improves MFSC's robustness in missing views by introducing a mask token. Extensive experimental results demonstrate that our method outperforms existing approaches in MVRL tasks. Even in more realistic scenarios with interference or missing views, MFSC consistently maintains high performance.
title	Learning Fused State Representations for Control from Multi-View Observations
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2502.01316

Similar Items