Saved in:
Bibliographic Details
Main Authors: Liu, Yang, Sun, Peng, Li, Hang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.08078
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929241957859328
author Liu, Yang
Sun, Peng
Li, Hang
author_facet Liu, Yang
Sun, Peng
Li, Hang
contents By formally defining the training processes of large language models (LLMs), which usually encompasses pre-training, supervised fine-tuning, and reinforcement learning with human feedback, within a single and unified machine learning paradigm, we can glean pivotal insights for advancing LLM technologies. This position paper delineates the parallels between the training methods of LLMs and the strategies employed for the development of agents in two-player games, as studied in game theory, reinforcement learning, and multi-agent systems. We propose a re-conceptualization of LLM learning processes in terms of agent learning in language-based games. This framework unveils innovative perspectives on the successes and challenges in LLM development, offering a fresh understanding of addressing alignment issues among other strategic considerations. Furthermore, our two-player game approach sheds light on novel data preparation and machine learning techniques for training LLMs.
format Preprint
id arxiv_https___arxiv_org_abs_2402_08078
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Large Language Models as Agents in Two-Player Games
Liu, Yang
Sun, Peng
Li, Hang
Computation and Language
Machine Learning
By formally defining the training processes of large language models (LLMs), which usually encompasses pre-training, supervised fine-tuning, and reinforcement learning with human feedback, within a single and unified machine learning paradigm, we can glean pivotal insights for advancing LLM technologies. This position paper delineates the parallels between the training methods of LLMs and the strategies employed for the development of agents in two-player games, as studied in game theory, reinforcement learning, and multi-agent systems. We propose a re-conceptualization of LLM learning processes in terms of agent learning in language-based games. This framework unveils innovative perspectives on the successes and challenges in LLM development, offering a fresh understanding of addressing alignment issues among other strategic considerations. Furthermore, our two-player game approach sheds light on novel data preparation and machine learning techniques for training LLMs.
title Large Language Models as Agents in Two-Player Games
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2402.08078