Saved in:
Bibliographic Details
Main Authors: Roth, Manuel M. H., Hegde, Anupama, Delamotte, Thomas, Knopp, Andreas
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.01979
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910554896990208
author Roth, Manuel M. H.
Hegde, Anupama
Delamotte, Thomas
Knopp, Andreas
author_facet Roth, Manuel M. H.
Hegde, Anupama
Delamotte, Thomas
Knopp, Andreas
contents Effective routing in satellite mega-constellations has become crucial to facilitate the handling of increasing traffic loads, more complex network architectures, as well as the integration into 6G networks. To enhance adaptability as well as robustness to unpredictable traffic demands, and to solve dynamic routing environments efficiently, machine learning-based solutions are being considered. For network control problems, such as optimizing packet forwarding decisions according to Quality of Service requirements and maintaining network stability, deep reinforcement learning techniques have demonstrated promising results. For this reason, we investigate the viability of multi-agent deep Q-networks for routing in satellite constellation networks. We focus specifically on reward shaping and quantifying training convergence for joint optimization of latency and load balancing in static and dynamic scenarios. To address identified drawbacks, we propose a novel hybrid solution based on centralized learning and decentralized control.
format Preprint
id arxiv_https___arxiv_org_abs_2408_01979
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks
Roth, Manuel M. H.
Hegde, Anupama
Delamotte, Thomas
Knopp, Andreas
Networking and Internet Architecture
Machine Learning
C.2.1
Effective routing in satellite mega-constellations has become crucial to facilitate the handling of increasing traffic loads, more complex network architectures, as well as the integration into 6G networks. To enhance adaptability as well as robustness to unpredictable traffic demands, and to solve dynamic routing environments efficiently, machine learning-based solutions are being considered. For network control problems, such as optimizing packet forwarding decisions according to Quality of Service requirements and maintaining network stability, deep reinforcement learning techniques have demonstrated promising results. For this reason, we investigate the viability of multi-agent deep Q-networks for routing in satellite constellation networks. We focus specifically on reward shaping and quantifying training convergence for joint optimization of latency and load balancing in static and dynamic scenarios. To address identified drawbacks, we propose a novel hybrid solution based on centralized learning and decentralized control.
title Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks
topic Networking and Internet Architecture
Machine Learning
C.2.1
url https://arxiv.org/abs/2408.01979