Saved in:
Bibliographic Details
Main Authors: Zeng, Lingjia, Li, Manman
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.15167
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917151294619648
author Zeng, Lingjia
Li, Manman
author_facet Zeng, Lingjia
Li, Manman
contents This study considers an optimal reinsurance, investment, and dividend strategy control problem for insurance companies in a regulated Markov regime-switching environment, intending to maximize long-run average reward. Unlike existing single or dual strategy studies, an integrated control framework is established under solvency constraints, allowing investment and dividends only when the surplus process exceeds a minimum cash requirement level. To address the analytical difficulties associated with solving HJB equations and stationary distributions in high-dimensional state spaces under regime switching, we construct a numerical approximation scheme for the optimal strategy function based on Markov chains and neural networks. Furthermore, we establish the convergence of the corresponding sequence of surplus processes and rigorously prove that the associated optimal values converge to the true value function. Finally, we provide a numerical example based on the approximate dynamic programming method to demonstrate the feasibility of the proposed method.
format Preprint
id arxiv_https___arxiv_org_abs_2512_15167
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Long-Run Average Reward Maximization of A Regulated Regime-Switching Diffusion Model
Zeng, Lingjia
Li, Manman
Optimization and Control
60J20, 49M25, 90C40
This study considers an optimal reinsurance, investment, and dividend strategy control problem for insurance companies in a regulated Markov regime-switching environment, intending to maximize long-run average reward. Unlike existing single or dual strategy studies, an integrated control framework is established under solvency constraints, allowing investment and dividends only when the surplus process exceeds a minimum cash requirement level. To address the analytical difficulties associated with solving HJB equations and stationary distributions in high-dimensional state spaces under regime switching, we construct a numerical approximation scheme for the optimal strategy function based on Markov chains and neural networks. Furthermore, we establish the convergence of the corresponding sequence of surplus processes and rigorously prove that the associated optimal values converge to the true value function. Finally, we provide a numerical example based on the approximate dynamic programming method to demonstrate the feasibility of the proposed method.
title Long-Run Average Reward Maximization of A Regulated Regime-Switching Diffusion Model
topic Optimization and Control
60J20, 49M25, 90C40
url https://arxiv.org/abs/2512.15167