Saved in:
Bibliographic Details
Main Authors: Abdelkareem, Youssef, Shehata, Shady, Karray, Fakhri
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.11943
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913476546396160
author Abdelkareem, Youssef
Shehata, Shady
Karray, Fakhri
author_facet Abdelkareem, Youssef
Shehata, Shady
Karray, Fakhri
contents Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by utilizing human preferences as feedback from the experts instead of numeric rewards. Due to its promising advantage over traditional RL, PbRL has gained more focus in recent years with many significant advances. In this survey, we present a unified PbRL framework to include the newly emerging approaches that improve the scalability and efficiency of PbRL. In addition, we give a detailed overview of the theoretical guarantees and benchmarking work done in the field, while presenting its recent applications in complex real-world tasks. Lastly, we go over the limitations of the current approaches and the proposed future research directions.
format Preprint
id arxiv_https___arxiv_org_abs_2408_11943
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Advances in Preference-based Reinforcement Learning: A Review
Abdelkareem, Youssef
Shehata, Shady
Karray, Fakhri
Artificial Intelligence
Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by utilizing human preferences as feedback from the experts instead of numeric rewards. Due to its promising advantage over traditional RL, PbRL has gained more focus in recent years with many significant advances. In this survey, we present a unified PbRL framework to include the newly emerging approaches that improve the scalability and efficiency of PbRL. In addition, we give a detailed overview of the theoretical guarantees and benchmarking work done in the field, while presenting its recent applications in complex real-world tasks. Lastly, we go over the limitations of the current approaches and the proposed future research directions.
title Advances in Preference-based Reinforcement Learning: A Review
topic Artificial Intelligence
url https://arxiv.org/abs/2408.11943