Saved in:
Bibliographic Details
Main Author: Ikemoto, Junya
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.20152
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910215069237248
author Ikemoto, Junya
author_facet Ikemoto, Junya
contents This paper proposes a simulation-based reinforcement learning algorithm for controlling systems with uncertain and varying system parameters. While simulators are useful for safely learning control policies, the reality gap remains a major challenge. To alleviate this challenge, we propose a two-stage algorithm. First, multiple control policies are learned for systems with different system parameters in a simulator. Second, for a real system, the control policies are adaptively switched using an online convex optimization algorithm based on observations. This approach is expected to reduce learning complexity compared with existing approaches that rely on a single policy to address the reality gap.
format Preprint
id arxiv_https___arxiv_org_abs_2510_20152
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Soft Switching Expert Policies for Controlling Systems with Uncertain Parameters
Ikemoto, Junya
Systems and Control
This paper proposes a simulation-based reinforcement learning algorithm for controlling systems with uncertain and varying system parameters. While simulators are useful for safely learning control policies, the reality gap remains a major challenge. To alleviate this challenge, we propose a two-stage algorithm. First, multiple control policies are learned for systems with different system parameters in a simulator. Second, for a real system, the control policies are adaptively switched using an online convex optimization algorithm based on observations. This approach is expected to reduce learning complexity compared with existing approaches that rely on a single policy to address the reality gap.
title Soft Switching Expert Policies for Controlling Systems with Uncertain Parameters
topic Systems and Control
url https://arxiv.org/abs/2510.20152