Saved in:
Bibliographic Details
Main Authors: Pugliese, Victor Ulisses, Ferreira, Oséias F. de A., Faria, Fabio A.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.17573
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910888755200000
author Pugliese, Victor Ulisses
Ferreira, Oséias F. de A.
Faria, Fabio A.
author_facet Pugliese, Victor Ulisses
Ferreira, Oséias F. de A.
Faria, Fabio A.
contents This paper proposes a novel approach based on deep reinforcement learning (DRL) for the 2D+1 packing problem with spatial constraints. This problem is an extension of the traditional 2D packing problem, incorporating an additional constraint on the height dimension. Therefore, a simulator using the OpenAI Gym framework has been developed to efficiently simulate the packing of rectangular pieces onto two boards with height constraints. Furthermore, the simulator supports multidiscrete actions, enabling the selection of a position on either board and the type of piece to place. Finally, two DRL-based methods (Proximal Policy Optimization -- PPO and the Advantage Actor-Critic -- A2C) have been employed to learn a packing strategy and demonstrate its performance compared to a well-known heuristic baseline (MaxRect-BL). In the experiments carried out, the PPO-based approach proved to be a good solution for solving complex packaging problems and highlighted its potential to optimize resource utilization in various industrial applications, such as the manufacturing of aerospace composites.
format Preprint
id arxiv_https___arxiv_org_abs_2503_17573
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning
Pugliese, Victor Ulisses
Ferreira, Oséias F. de A.
Faria, Fabio A.
Machine Learning
This paper proposes a novel approach based on deep reinforcement learning (DRL) for the 2D+1 packing problem with spatial constraints. This problem is an extension of the traditional 2D packing problem, incorporating an additional constraint on the height dimension. Therefore, a simulator using the OpenAI Gym framework has been developed to efficiently simulate the packing of rectangular pieces onto two boards with height constraints. Furthermore, the simulator supports multidiscrete actions, enabling the selection of a position on either board and the type of piece to place. Finally, two DRL-based methods (Proximal Policy Optimization -- PPO and the Advantage Actor-Critic -- A2C) have been employed to learn a packing strategy and demonstrate its performance compared to a well-known heuristic baseline (MaxRect-BL). In the experiments carried out, the PPO-based approach proved to be a good solution for solving complex packaging problems and highlighted its potential to optimize resource utilization in various industrial applications, such as the manufacturing of aerospace composites.
title Optimizing 2D+1 Packing in Constrained Environments Using Deep Reinforcement Learning
topic Machine Learning
url https://arxiv.org/abs/2503.17573