Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Bogoclu, Can, Vosshall, Robert, Cremanns, Kevin, Roos, Dirk
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2403.15908
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917621167816704
author	Bogoclu, Can Vosshall, Robert Cremanns, Kevin Roos, Dirk
author_facet	Bogoclu, Can Vosshall, Robert Cremanns, Kevin Roos, Dirk
contents	Probabilistic world models increase data efficiency of model-based reinforcement learning (MBRL) by guiding the policy with their epistemic uncertainty to improve exploration and acquire new samples. Moreover, the uncertainty-aware learning procedures in probabilistic approaches lead to robust policies that are less sensitive to noisy observations compared to uncertainty unaware solutions. We propose to combine trajectory sampling and deep Gaussian covariance network (DGCN) for a data-efficient solution to MBRL problems in an optimal control setting. We compare trajectory sampling with density-based approximation for uncertainty propagation using three different probabilistic world models; Gaussian processes, Bayesian neural networks, and DGCNs. We provide empirical evidence using four different well-known test environments, that our method improves the sample-efficiency over other combinations of uncertainty propagation methods and probabilistic models. During our tests, we place particular emphasis on the robustness of the learned policies with respect to noisy initial states.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_15908
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search Bogoclu, Can Vosshall, Robert Cremanns, Kevin Roos, Dirk Machine Learning Probabilistic world models increase data efficiency of model-based reinforcement learning (MBRL) by guiding the policy with their epistemic uncertainty to improve exploration and acquire new samples. Moreover, the uncertainty-aware learning procedures in probabilistic approaches lead to robust policies that are less sensitive to noisy observations compared to uncertainty unaware solutions. We propose to combine trajectory sampling and deep Gaussian covariance network (DGCN) for a data-efficient solution to MBRL problems in an optimal control setting. We compare trajectory sampling with density-based approximation for uncertainty propagation using three different probabilistic world models; Gaussian processes, Bayesian neural networks, and DGCNs. We provide empirical evidence using four different well-known test environments, that our method improves the sample-efficiency over other combinations of uncertainty propagation methods and probabilistic models. During our tests, we place particular emphasis on the robustness of the learned policies with respect to noisy initial states.
title	Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search
topic	Machine Learning
url	https://arxiv.org/abs/2403.15908

Similar Items