Saved in:
Bibliographic Details
Main Author: Popescu, Diana Andreea
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2408.09497
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910574533672960
author Popescu, Diana Andreea
author_facet Popescu, Diana Andreea
contents Data centers have become ubiquitous for today's businesses. From banks to startups, they rely on cloud infrastructure to deploy user applications. In this context, it is vital to provide users with application performance guarantees. Network interference is one of the causes of unpredictable application performance, and many solutions have been proposed over the years. The main objective of this survey is to familiarize the reader with research into network measurement-based resource allocation and control in data centers, focusing on network resources in order to provide cloud performance guarantees. We start with a primer on general network measurement techniques and data center network and applications to give the reader context. We then summarize the characteristics of network traffic and cluster workloads in data centers, which are pivotal for measurement-based allocation and control. We study and compare network monitoring in data centers, giving an overview on their evolution from Software-Defined Networking (SDN) to programmable dataplanes-based. The network monitoring information can serve as input to cluster allocation and scheduling decisions. We next categorize cluster scheduling frameworks, and perform an analysis of those that provide network guarantees in data centers, and we also look at emergent Machine Learning-driven resource allocation and control. We conclude with a discussion about future research directions.
format Preprint
id arxiv_https___arxiv_org_abs_2408_09497
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Measurement-based Resource Allocation and Control in Data Centers: A Survey
Popescu, Diana Andreea
Networking and Internet Architecture
Data centers have become ubiquitous for today's businesses. From banks to startups, they rely on cloud infrastructure to deploy user applications. In this context, it is vital to provide users with application performance guarantees. Network interference is one of the causes of unpredictable application performance, and many solutions have been proposed over the years. The main objective of this survey is to familiarize the reader with research into network measurement-based resource allocation and control in data centers, focusing on network resources in order to provide cloud performance guarantees. We start with a primer on general network measurement techniques and data center network and applications to give the reader context. We then summarize the characteristics of network traffic and cluster workloads in data centers, which are pivotal for measurement-based allocation and control. We study and compare network monitoring in data centers, giving an overview on their evolution from Software-Defined Networking (SDN) to programmable dataplanes-based. The network monitoring information can serve as input to cluster allocation and scheduling decisions. We next categorize cluster scheduling frameworks, and perform an analysis of those that provide network guarantees in data centers, and we also look at emergent Machine Learning-driven resource allocation and control. We conclude with a discussion about future research directions.
title Measurement-based Resource Allocation and Control in Data Centers: A Survey
topic Networking and Internet Architecture
url https://arxiv.org/abs/2408.09497