Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Lai, Matthew, Go, Keegan, Li, Zhibin, Kroger, Torsten, Schaal, Stefan, Allen, Kelsey, Scholz, Jonathan
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Robotics Machine Learning
Online-Zugang:	https://arxiv.org/abs/2509.05397
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866912573980409856
author	Lai, Matthew Go, Keegan Li, Zhibin Kroger, Torsten Schaal, Stefan Allen, Kelsey Scholz, Jonathan
author_facet	Lai, Matthew Go, Keegan Li, Zhibin Kroger, Torsten Schaal, Stefan Allen, Kelsey Scholz, Jonathan
contents	Modern robotic manufacturing requires collision-free coordination of multiple robots to complete numerous tasks in shared, obstacle-rich workspaces. Although individual tasks may be simple in isolation, automated joint task allocation, scheduling, and motion planning under spatio-temporal constraints remain computationally intractable for classical methods at real-world scales. Existing multi-arm systems deployed in the industry rely on human intuition and experience to design feasible trajectories manually in a labor-intensive process. To address this challenge, we propose a reinforcement learning (RL) framework to achieve automated task and motion planning, tested in an obstacle-rich environment with eight robots performing 40 reaching tasks in a shared workspace, where any robot can perform any task in any order. Our approach builds on a graph neural network (GNN) policy trained via RL on procedurally-generated environments with diverse obstacle layouts, robot configurations, and task distributions. It employs a graph representation of scenes and a graph policy neural network trained through reinforcement learning to generate trajectories of multiple robots, jointly solving the sub-problems of task allocation, scheduling, and motion planning. Trained on large randomly generated task sets in simulation, our policy generalizes zero-shot to unseen settings with varying robot placements, obstacle geometries, and task poses. We further demonstrate that the high-speed capability of our solution enables its use in workcell layout optimization, improving solution times. The speed and scalability of our planner also open the door to new capabilities such as fault-tolerant planning and online perception-based re-planning, where rapid adaptation to dynamic task sets is required.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_05397
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning Lai, Matthew Go, Keegan Li, Zhibin Kroger, Torsten Schaal, Stefan Allen, Kelsey Scholz, Jonathan Robotics Machine Learning Modern robotic manufacturing requires collision-free coordination of multiple robots to complete numerous tasks in shared, obstacle-rich workspaces. Although individual tasks may be simple in isolation, automated joint task allocation, scheduling, and motion planning under spatio-temporal constraints remain computationally intractable for classical methods at real-world scales. Existing multi-arm systems deployed in the industry rely on human intuition and experience to design feasible trajectories manually in a labor-intensive process. To address this challenge, we propose a reinforcement learning (RL) framework to achieve automated task and motion planning, tested in an obstacle-rich environment with eight robots performing 40 reaching tasks in a shared workspace, where any robot can perform any task in any order. Our approach builds on a graph neural network (GNN) policy trained via RL on procedurally-generated environments with diverse obstacle layouts, robot configurations, and task distributions. It employs a graph representation of scenes and a graph policy neural network trained through reinforcement learning to generate trajectories of multiple robots, jointly solving the sub-problems of task allocation, scheduling, and motion planning. Trained on large randomly generated task sets in simulation, our policy generalizes zero-shot to unseen settings with varying robot placements, obstacle geometries, and task poses. We further demonstrate that the high-speed capability of our solution enables its use in workcell layout optimization, improving solution times. The speed and scalability of our planner also open the door to new capabilities such as fault-tolerant planning and online perception-based re-planning, where rapid adaptation to dynamic task sets is required.
title	RoboBallet: Planning for Multi-Robot Reaching with Graph Neural Networks and Reinforcement Learning
topic	Robotics Machine Learning
url	https://arxiv.org/abs/2509.05397

Ähnliche Einträge