Saved in:
Bibliographic Details
Main Authors: Luo, Shuang, Li, Yinchuan, Liu, Shunyu, Zhang, Xu, Shao, Yunfeng, Wu, Chao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.06920
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929457865949184
author Luo, Shuang
Li, Yinchuan
Liu, Shunyu
Zhang, Xu
Shao, Yunfeng
Wu, Chao
author_facet Luo, Shuang
Li, Yinchuan
Liu, Shunyu
Zhang, Xu
Shao, Yunfeng
Wu, Chao
contents Generative Flow Networks (GFlowNets) aim to generate diverse trajectories from a distribution in which the final states of the trajectories are proportional to the reward, serving as a powerful alternative to reinforcement learning for exploratory control tasks. However, the individual-flow matching constraint in GFlowNets limits their applications for multi-agent systems, especially continuous joint-control problems. In this paper, we propose a novel Multi-Agent generative Continuous Flow Networks (MACFN) method to enable multiple agents to perform cooperative exploration for various compositional continuous objects. Technically, MACFN trains decentralized individual-flow-based policies in a centralized global-flow-based matching fashion. During centralized training, MACFN introduces a continuous flow decomposition network to deduce the flow contributions of each agent in the presence of only global rewards. Then agents can deliver actions solely based on their assigned local flow in a decentralized way, forming a joint policy distribution proportional to the rewards. To guarantee the expressiveness of continuous flow decomposition, we theoretically derive a consistency condition on the decomposition network. Experimental results demonstrate that the proposed method yields results superior to the state-of-the-art counterparts and better exploration capability. Our code is available at https://github.com/isluoshuang/MACFN.
format Preprint
id arxiv_https___arxiv_org_abs_2408_06920
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Multi-Agent Continuous Control with Generative Flow Networks
Luo, Shuang
Li, Yinchuan
Liu, Shunyu
Zhang, Xu
Shao, Yunfeng
Wu, Chao
Artificial Intelligence
Multiagent Systems
Generative Flow Networks (GFlowNets) aim to generate diverse trajectories from a distribution in which the final states of the trajectories are proportional to the reward, serving as a powerful alternative to reinforcement learning for exploratory control tasks. However, the individual-flow matching constraint in GFlowNets limits their applications for multi-agent systems, especially continuous joint-control problems. In this paper, we propose a novel Multi-Agent generative Continuous Flow Networks (MACFN) method to enable multiple agents to perform cooperative exploration for various compositional continuous objects. Technically, MACFN trains decentralized individual-flow-based policies in a centralized global-flow-based matching fashion. During centralized training, MACFN introduces a continuous flow decomposition network to deduce the flow contributions of each agent in the presence of only global rewards. Then agents can deliver actions solely based on their assigned local flow in a decentralized way, forming a joint policy distribution proportional to the rewards. To guarantee the expressiveness of continuous flow decomposition, we theoretically derive a consistency condition on the decomposition network. Experimental results demonstrate that the proposed method yields results superior to the state-of-the-art counterparts and better exploration capability. Our code is available at https://github.com/isluoshuang/MACFN.
title Multi-Agent Continuous Control with Generative Flow Networks
topic Artificial Intelligence
Multiagent Systems
url https://arxiv.org/abs/2408.06920