Saved in:
Bibliographic Details
Main Authors: Schäfer, Georg, Seliger, Raphael, Rehrl, Jakob, Huber, Stefan, Hirlaender, Simon
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.07607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915283765035008
author Schäfer, Georg
Seliger, Raphael
Rehrl, Jakob
Huber, Stefan
Hirlaender, Simon
author_facet Schäfer, Georg
Seliger, Raphael
Rehrl, Jakob
Huber, Stefan
Hirlaender, Simon
contents Industrial automation increasingly demands energy-efficient control strategies to balance performance with environmental and cost constraints. In this work, we present a multi-objective reinforcement learning (MORL) framework for energy-efficient control of the Quanser Aero 2 testbed in its one-degree-of-freedom configuration. We design a composite reward function that simultaneously penalizes tracking error and electrical power consumption. Preliminary experiments explore the influence of varying the Energy penalty weight, alpha, on the trade-off between pitch tracking and energy savings. Our results reveal a marked performance shift for alpha values between 0.0 and 0.25, with non-Pareto optimal solutions emerging at lower alpha values, on both the simulation and the real system. We hypothesize that these effects may be attributed to artifacts introduced by the adaptive behavior of the Adam optimizer, which could bias the learning process and favor bang-bang control strategies. Future work will focus on automating alpha selection through Gaussian Process-based Pareto front modeling and transitioning the approach from simulation to real-world deployment.
format Preprint
id arxiv_https___arxiv_org_abs_2505_07607
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Multi-Objective Reinforcement Learning for Energy-Efficient Industrial Control
Schäfer, Georg
Seliger, Raphael
Rehrl, Jakob
Huber, Stefan
Hirlaender, Simon
Systems and Control
Machine Learning
Industrial automation increasingly demands energy-efficient control strategies to balance performance with environmental and cost constraints. In this work, we present a multi-objective reinforcement learning (MORL) framework for energy-efficient control of the Quanser Aero 2 testbed in its one-degree-of-freedom configuration. We design a composite reward function that simultaneously penalizes tracking error and electrical power consumption. Preliminary experiments explore the influence of varying the Energy penalty weight, alpha, on the trade-off between pitch tracking and energy savings. Our results reveal a marked performance shift for alpha values between 0.0 and 0.25, with non-Pareto optimal solutions emerging at lower alpha values, on both the simulation and the real system. We hypothesize that these effects may be attributed to artifacts introduced by the adaptive behavior of the Adam optimizer, which could bias the learning process and favor bang-bang control strategies. Future work will focus on automating alpha selection through Gaussian Process-based Pareto front modeling and transitioning the approach from simulation to real-world deployment.
title Multi-Objective Reinforcement Learning for Energy-Efficient Industrial Control
topic Systems and Control
Machine Learning
url https://arxiv.org/abs/2505.07607