Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ramadan, Mohammad S., Al-Tawaha, Ahmad, Shouman, Mohamed, Atallah, Ahmed, Jin, Ming
Format:	Preprint
Published:	2023
Subjects:	Systems and Control
Online Access:	https://arxiv.org/abs/2303.06200
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914942007902208
author	Ramadan, Mohammad S. Al-Tawaha, Ahmad Shouman, Mohamed Atallah, Ahmed Jin, Ming
author_facet	Ramadan, Mohammad S. Al-Tawaha, Ahmad Shouman, Mohamed Atallah, Ahmed Jin, Ming
contents	Dynamic Programming (DP) suffers from the well-known ``curse of dimensionality'', further exacerbated by the need to compute expectations over process noise in stochastic models. This paper presents a Monte Carlo-based sampling approach for the state space and an interpolation procedure for the resulting value function, dependent on the process noise density, in a "self-approximating" fashion, eliminating the need for ordering or set-membership tests. We provide proof of almost sure convergence for the value iteration (and consequently, policy iteration) procedure. The proposed meshless sampling and interpolation algorithm alleviates the burden of gridding the state space, traditionally required in DP, and avoids constructing a piecewise constant value function over a grid. Moreover, we demonstrate that the proposed interpolation procedure is well-suited for handling probabilistic constraints by sampling both infeasible and feasible regions. The curse of dimensionality cannot be avoided, however, this approach offers a practical framework for addressing lower-order stochastic nonlinear systems with probabilistic constraints, while eliminating the need for linear interpolations and set membership tests. Numerical examples are presented to further explain and illustrate the convenience of the proposed algorithms.
format	Preprint
id	arxiv_https___arxiv_org_abs_2303_06200
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Monte Carlo Grid Dynamic Programming: Almost Sure Convergence and Probability Constraints Ramadan, Mohammad S. Al-Tawaha, Ahmad Shouman, Mohamed Atallah, Ahmed Jin, Ming Systems and Control Dynamic Programming (DP) suffers from the well-known ``curse of dimensionality'', further exacerbated by the need to compute expectations over process noise in stochastic models. This paper presents a Monte Carlo-based sampling approach for the state space and an interpolation procedure for the resulting value function, dependent on the process noise density, in a "self-approximating" fashion, eliminating the need for ordering or set-membership tests. We provide proof of almost sure convergence for the value iteration (and consequently, policy iteration) procedure. The proposed meshless sampling and interpolation algorithm alleviates the burden of gridding the state space, traditionally required in DP, and avoids constructing a piecewise constant value function over a grid. Moreover, we demonstrate that the proposed interpolation procedure is well-suited for handling probabilistic constraints by sampling both infeasible and feasible regions. The curse of dimensionality cannot be avoided, however, this approach offers a practical framework for addressing lower-order stochastic nonlinear systems with probabilistic constraints, while eliminating the need for linear interpolations and set membership tests. Numerical examples are presented to further explain and illustrate the convenience of the proposed algorithms.
title	Monte Carlo Grid Dynamic Programming: Almost Sure Convergence and Probability Constraints
topic	Systems and Control
url	https://arxiv.org/abs/2303.06200

Similar Items