Saved in:
Bibliographic Details
Main Authors: Iida, Satoko, Yasudo, Ryota
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.00539
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916186496696320
author Iida, Satoko
Yasudo, Ryota
author_facet Iida, Satoko
Yasudo, Ryota
contents Quadratic Assignment Problem (QAP) is a practical combinatorial optimization problems that has been studied for several years. Since it is NP-hard, solving large problem instances of QAP is challenging. Although heuristics can find semi-optimal solutions, the execution time significantly increases as the problem size increases. Recently, solving combinatorial optimization problems by deep learning has been attracting attention as a faster solver than heuristics. Even with deep learning, however, solving large QAP is still challenging. In this paper, we propose the deep reinforcement learning model called the two-stage graph pointer network (GPN) for solving QAP. Two-stage GPN relies on GPN, which has been proposed for Euclidean Traveling Salesman Problem (TSP). First, we extend GPN for general TSP, and then we add new algorithms to that model for solving QAP. Our experimental results show that our two-stage GPN provides semi-optimal solutions for benchmark problem instances from TSPlib and QAPLIB.
format Preprint
id arxiv_https___arxiv_org_abs_2404_00539
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Solving the QAP by Two-Stage Graph Pointer Networks and Reinforcement Learning
Iida, Satoko
Yasudo, Ryota
Machine Learning
Quadratic Assignment Problem (QAP) is a practical combinatorial optimization problems that has been studied for several years. Since it is NP-hard, solving large problem instances of QAP is challenging. Although heuristics can find semi-optimal solutions, the execution time significantly increases as the problem size increases. Recently, solving combinatorial optimization problems by deep learning has been attracting attention as a faster solver than heuristics. Even with deep learning, however, solving large QAP is still challenging. In this paper, we propose the deep reinforcement learning model called the two-stage graph pointer network (GPN) for solving QAP. Two-stage GPN relies on GPN, which has been proposed for Euclidean Traveling Salesman Problem (TSP). First, we extend GPN for general TSP, and then we add new algorithms to that model for solving QAP. Our experimental results show that our two-stage GPN provides semi-optimal solutions for benchmark problem instances from TSPlib and QAPLIB.
title Solving the QAP by Two-Stage Graph Pointer Networks and Reinforcement Learning
topic Machine Learning
url https://arxiv.org/abs/2404.00539