Saved in:
Bibliographic Details
Main Authors: Zakaria, Khabbab, Jerinsh, Jayapaulraj, Maier, Andreas, Krauss, Patrick, Pasquali, Stefano, Mehta, Dhagash
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.04896
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911365461966848
author Zakaria, Khabbab
Jerinsh, Jayapaulraj
Maier, Andreas
Krauss, Patrick
Pasquali, Stefano
Mehta, Dhagash
author_facet Zakaria, Khabbab
Jerinsh, Jayapaulraj
Maier, Andreas
Krauss, Patrick
Pasquali, Stefano
Mehta, Dhagash
contents Optimal Order Execution is a well-established problem in finance that pertains to the flawless execution of a trade (buy or sell) for a given volume within a specified time frame. This problem revolves around optimizing returns while minimizing risk, yet recent research predominantly focuses on addressing one aspect of this challenge. In this paper, we introduce an innovative approach to Optimal Order Execution within the US market, leveraging Deep Reinforcement Learning (DRL) to effectively address this optimization problem holistically. Our study assesses the performance of our model in comparison to two widely employed execution strategies: Volume Weighted Average Price (VWAP) and Time Weighted Average Price (TWAP). Our experimental findings clearly demonstrate that our DRL-based approach outperforms both VWAP and TWAP in terms of return on investment and risk management. The model's ability to adapt dynamically to market conditions, even during periods of market stress, underscores its promise as a robust solution.
format Preprint
id arxiv_https___arxiv_org_abs_2601_04896
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Deep Reinforcement Learning for Optimum Order Execution: Mitigating Risk and Maximizing Returns
Zakaria, Khabbab
Jerinsh, Jayapaulraj
Maier, Andreas
Krauss, Patrick
Pasquali, Stefano
Mehta, Dhagash
Computational Finance
Optimal Order Execution is a well-established problem in finance that pertains to the flawless execution of a trade (buy or sell) for a given volume within a specified time frame. This problem revolves around optimizing returns while minimizing risk, yet recent research predominantly focuses on addressing one aspect of this challenge. In this paper, we introduce an innovative approach to Optimal Order Execution within the US market, leveraging Deep Reinforcement Learning (DRL) to effectively address this optimization problem holistically. Our study assesses the performance of our model in comparison to two widely employed execution strategies: Volume Weighted Average Price (VWAP) and Time Weighted Average Price (TWAP). Our experimental findings clearly demonstrate that our DRL-based approach outperforms both VWAP and TWAP in terms of return on investment and risk management. The model's ability to adapt dynamically to market conditions, even during periods of market stress, underscores its promise as a robust solution.
title Deep Reinforcement Learning for Optimum Order Execution: Mitigating Risk and Maximizing Returns
topic Computational Finance
url https://arxiv.org/abs/2601.04896