Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jia, Ziqi, Li, Junjie, Qu, Xiaoyang, Wang, Jianzong
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.10049
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929757566795776
author	Jia, Ziqi Li, Junjie Qu, Xiaoyang Wang, Jianzong
author_facet	Jia, Ziqi Li, Junjie Qu, Xiaoyang Wang, Jianzong
contents	Multi-agent systems (MAS) have shown great potential in executing complex tasks, but coordination and safety remain significant challenges. Multi-Agent Reinforcement Learning (MARL) offers a promising framework for agent collaboration, but it faces difficulties in handling complex tasks and designing reward functions. The introduction of Large Language Models (LLMs) has brought stronger reasoning and cognitive abilities to MAS, but existing LLM-based systems struggle to respond quickly and accurately in dynamic environments. To address these challenges, we propose LLM-based Graph Collaboration MARL (LGC-MARL), a framework that efficiently combines LLMs and MARL. This framework decomposes complex tasks into executable subtasks and achieves efficient collaboration among multiple agents through graph-based coordination. Specifically, LGC-MARL consists of two main components: an LLM planner and a graph-based collaboration meta policy. The LLM planner transforms complex task instructions into a series of executable subtasks, evaluates the rationality of these subtasks using a critic model, and generates an action dependency graph. The graph-based collaboration meta policy facilitates communication and collaboration among agents based on the action dependency graph, and adapts to new task environments through meta-learning. Experimental results on the AI2-THOR simulation platform demonstrate the superior performance and scalability of LGC-MARL in completing various complex tasks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_10049
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy Jia, Ziqi Li, Junjie Qu, Xiaoyang Wang, Jianzong Computer Vision and Pattern Recognition Multi-agent systems (MAS) have shown great potential in executing complex tasks, but coordination and safety remain significant challenges. Multi-Agent Reinforcement Learning (MARL) offers a promising framework for agent collaboration, but it faces difficulties in handling complex tasks and designing reward functions. The introduction of Large Language Models (LLMs) has brought stronger reasoning and cognitive abilities to MAS, but existing LLM-based systems struggle to respond quickly and accurately in dynamic environments. To address these challenges, we propose LLM-based Graph Collaboration MARL (LGC-MARL), a framework that efficiently combines LLMs and MARL. This framework decomposes complex tasks into executable subtasks and achieves efficient collaboration among multiple agents through graph-based coordination. Specifically, LGC-MARL consists of two main components: an LLM planner and a graph-based collaboration meta policy. The LLM planner transforms complex task instructions into a series of executable subtasks, evaluates the rationality of these subtasks using a critic model, and generates an action dependency graph. The graph-based collaboration meta policy facilitates communication and collaboration among agents based on the action dependency graph, and adapts to new task environments through meta-learning. Experimental results on the AI2-THOR simulation platform demonstrate the superior performance and scalability of LGC-MARL in completing various complex tasks.
title	Enhancing Multi-Agent Systems via Reinforcement Learning with LLM-based Planner and Graph-based Policy
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2503.10049

Similar Items