Saved in:
Bibliographic Details
Main Authors: Jia, Xiaolong, Bajaj, Nikhil
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.03354
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909823627427840
author Jia, Xiaolong
Bajaj, Nikhil
author_facet Jia, Xiaolong
Bajaj, Nikhil
contents Model Predictive Control (MPC) faces computational demands and performance degradation from model inaccuracies. We propose two architectures combining Neural Network-approximated MPC (NNMPC) with Reinforcement Learning (RL). The first, Warm Start RL, initializes the RL actor with pre-trained NNMPC weights. The second, RLMPC, uses RL to generate corrective residuals for NNMPC outputs. We introduce a downsampling method reducing NNMPC input dimensions while maintaining performance. Evaluated on a rotary inverted pendulum, both architectures demonstrate runtime reductions exceeding 99% compared to traditional MPC while improving tracking performance under model uncertainties, with RL+MPC achieving 11-40% cost reduction depending on reference amplitude.
format Preprint
id arxiv_https___arxiv_org_abs_2510_03354
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle On Architectures for Combining Reinforcement Learning and Model Predictive Control with Runtime Improvements
Jia, Xiaolong
Bajaj, Nikhil
Systems and Control
Model Predictive Control (MPC) faces computational demands and performance degradation from model inaccuracies. We propose two architectures combining Neural Network-approximated MPC (NNMPC) with Reinforcement Learning (RL). The first, Warm Start RL, initializes the RL actor with pre-trained NNMPC weights. The second, RLMPC, uses RL to generate corrective residuals for NNMPC outputs. We introduce a downsampling method reducing NNMPC input dimensions while maintaining performance. Evaluated on a rotary inverted pendulum, both architectures demonstrate runtime reductions exceeding 99% compared to traditional MPC while improving tracking performance under model uncertainties, with RL+MPC achieving 11-40% cost reduction depending on reference amplitude.
title On Architectures for Combining Reinforcement Learning and Model Predictive Control with Runtime Improvements
topic Systems and Control
url https://arxiv.org/abs/2510.03354