Saved in:
Bibliographic Details
Main Authors: Goto, Takeru, Kizumi, Yuki, Iwasaki, Shun
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.16559
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916658777423872
author Goto, Takeru
Kizumi, Yuki
Iwasaki, Shun
author_facet Goto, Takeru
Kizumi, Yuki
Iwasaki, Shun
contents This paper proposes a design scheme of reward function that constantly evaluates both driving states and actions for applying reinforcement learning to automated driving. In the field of reinforcement learning, reward functions often evaluate whether the goal is achieved by assigning values such as +1 for success and -1 for failure. This type of reward function can potentially obtain a policy that achieves the goal, but the process by which the goal is reached is not evaluated. However, process to reach a destination is important for automated driving, such as keeping velocity, avoiding risk, retaining distance from other cars, keeping comfortable for passengers. Therefore, the reward function designed by the proposed scheme is suited for automated driving by evaluating driving process. The effects of the proposed scheme are demonstrated on simulated circuit driving and highway cruising. Asynchronous Advantage Actor-Critic is used, and models are trained under some situations for generalization. The result shows that appropriate driving positions are obtained, such as traveling on the inside of corners, and rapid deceleration to turn along sharp curves. In highway cruising, the ego vehicle becomes able to change lane in an environment where there are other vehicles with suitable deceleration to avoid catching up to a front vehicle, and acceleration so that a rear vehicle does not catch up to the ego vehicle.
format Preprint
id arxiv_https___arxiv_org_abs_2503_16559
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Design of Reward Function on Reinforcement Learning for Automated Driving
Goto, Takeru
Kizumi, Yuki
Iwasaki, Shun
Robotics
This paper proposes a design scheme of reward function that constantly evaluates both driving states and actions for applying reinforcement learning to automated driving. In the field of reinforcement learning, reward functions often evaluate whether the goal is achieved by assigning values such as +1 for success and -1 for failure. This type of reward function can potentially obtain a policy that achieves the goal, but the process by which the goal is reached is not evaluated. However, process to reach a destination is important for automated driving, such as keeping velocity, avoiding risk, retaining distance from other cars, keeping comfortable for passengers. Therefore, the reward function designed by the proposed scheme is suited for automated driving by evaluating driving process. The effects of the proposed scheme are demonstrated on simulated circuit driving and highway cruising. Asynchronous Advantage Actor-Critic is used, and models are trained under some situations for generalization. The result shows that appropriate driving positions are obtained, such as traveling on the inside of corners, and rapid deceleration to turn along sharp curves. In highway cruising, the ego vehicle becomes able to change lane in an environment where there are other vehicles with suitable deceleration to avoid catching up to a front vehicle, and acceleration so that a rear vehicle does not catch up to the ego vehicle.
title Design of Reward Function on Reinforcement Learning for Automated Driving
topic Robotics
url https://arxiv.org/abs/2503.16559