Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jia, Xiaolong, Bajaj, Nikhil
Format:	Preprint
Published:	2025
Subjects:	Systems and Control
Online Access:	https://arxiv.org/abs/2510.03354
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909823627427840
author	Jia, Xiaolong Bajaj, Nikhil
author_facet	Jia, Xiaolong Bajaj, Nikhil
contents	Model Predictive Control (MPC) faces computational demands and performance degradation from model inaccuracies. We propose two architectures combining Neural Network-approximated MPC (NNMPC) with Reinforcement Learning (RL). The first, Warm Start RL, initializes the RL actor with pre-trained NNMPC weights. The second, RLMPC, uses RL to generate corrective residuals for NNMPC outputs. We introduce a downsampling method reducing NNMPC input dimensions while maintaining performance. Evaluated on a rotary inverted pendulum, both architectures demonstrate runtime reductions exceeding 99% compared to traditional MPC while improving tracking performance under model uncertainties, with RL+MPC achieving 11-40% cost reduction depending on reference amplitude.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_03354
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	On Architectures for Combining Reinforcement Learning and Model Predictive Control with Runtime Improvements Jia, Xiaolong Bajaj, Nikhil Systems and Control Model Predictive Control (MPC) faces computational demands and performance degradation from model inaccuracies. We propose two architectures combining Neural Network-approximated MPC (NNMPC) with Reinforcement Learning (RL). The first, Warm Start RL, initializes the RL actor with pre-trained NNMPC weights. The second, RLMPC, uses RL to generate corrective residuals for NNMPC outputs. We introduce a downsampling method reducing NNMPC input dimensions while maintaining performance. Evaluated on a rotary inverted pendulum, both architectures demonstrate runtime reductions exceeding 99% compared to traditional MPC while improving tracking performance under model uncertainties, with RL+MPC achieving 11-40% cost reduction depending on reference amplitude.
title	On Architectures for Combining Reinforcement Learning and Model Predictive Control with Runtime Improvements
topic	Systems and Control
url	https://arxiv.org/abs/2510.03354

Similar Items