Guardado en:
Detalles Bibliográficos
Autores principales: Jin, Xuanfa, Wang, Ziyan, Du, Yali, Fang, Meng, Zhang, Haifeng, Wang, Jun
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2405.19946
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866910780112240640
author Jin, Xuanfa
Wang, Ziyan
Du, Yali
Fang, Meng
Zhang, Haifeng
Wang, Jun
author_facet Jin, Xuanfa
Wang, Ziyan
Du, Yali
Fang, Meng
Zhang, Haifeng
Wang, Jun
contents Communication is a fundamental aspect of human society, facilitating the exchange of information and beliefs among people. Despite the advancements in large language models (LLMs), recent agents built with these often neglect the control over discussion tactics, which are essential in communication scenarios and games. As a variant of the famous communication game Werewolf, One Night Ultimate Werewolf (ONUW) requires players to develop strategic discussion policies due to the potential role changes that increase the uncertainty and complexity of the game. In this work, we first present the existence of the Perfect Bayesian Equilibria (PBEs) in two scenarios of the ONUW game: one with discussion and one without. The results showcase that the discussion greatly changes players' utilities by affecting their beliefs, emphasizing the significance of discussion tactics. Based on the insights obtained from the analyses, we propose an RL-instructed language agent framework, where a discussion policy trained by reinforcement learning (RL) is employed to determine appropriate discussion tactics to adopt. Our experimental results on several ONUW game settings demonstrate the effectiveness and generalizability of our proposed framework. The project page of our paper: $\href{https://one-night-ultimate-werewolf.github.io}{one-night-ultimate-werewolf.github.io}$.
format Preprint
id arxiv_https___arxiv_org_abs_2405_19946
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
Jin, Xuanfa
Wang, Ziyan
Du, Yali
Fang, Meng
Zhang, Haifeng
Wang, Jun
Artificial Intelligence
Communication is a fundamental aspect of human society, facilitating the exchange of information and beliefs among people. Despite the advancements in large language models (LLMs), recent agents built with these often neglect the control over discussion tactics, which are essential in communication scenarios and games. As a variant of the famous communication game Werewolf, One Night Ultimate Werewolf (ONUW) requires players to develop strategic discussion policies due to the potential role changes that increase the uncertainty and complexity of the game. In this work, we first present the existence of the Perfect Bayesian Equilibria (PBEs) in two scenarios of the ONUW game: one with discussion and one without. The results showcase that the discussion greatly changes players' utilities by affecting their beliefs, emphasizing the significance of discussion tactics. Based on the insights obtained from the analyses, we propose an RL-instructed language agent framework, where a discussion policy trained by reinforcement learning (RL) is employed to determine appropriate discussion tactics to adopt. Our experimental results on several ONUW game settings demonstrate the effectiveness and generalizability of our proposed framework. The project page of our paper: $\href{https://one-night-ultimate-werewolf.github.io}{one-night-ultimate-werewolf.github.io}$.
title Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf
topic Artificial Intelligence
url https://arxiv.org/abs/2405.19946