Saved in:
Bibliographic Details
Main Authors: Liu, Qingyuan, Yang, Yanning, Du, Dong, Xia, Yubin, Zhang, Ping, Feng, Jia, Larus, James, Chen, Haibo
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.00433
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914698217127936
author Liu, Qingyuan
Yang, Yanning
Du, Dong
Xia, Yubin
Zhang, Ping
Feng, Jia
Larus, James
Chen, Haibo
author_facet Liu, Qingyuan
Yang, Yanning
Du, Dong
Xia, Yubin
Zhang, Ping
Feng, Jia
Larus, James
Chen, Haibo
contents Current serverless platforms struggle to optimize resource utilization due to their dynamic and fine-grained nature. Conventional techniques like overcommitment and autoscaling fall short, often sacrificing utilization for practicability or incurring performance trade-offs. Overcommitment requires predicting performance to prevent QoS violation, introducing trade-off between prediction accuracy and overheads. Autoscaling requires scaling instances in response to load fluctuations quickly to reduce resource wastage, but more frequent scaling also leads to more cold start overheads. This paper introduces Jiagu, which harmonizes efficiency with practicability through two novel techniques. First, pre-decision scheduling achieves accurate prediction while eliminating overheads by decoupling prediction and scheduling. Second, dual-staged scaling achieves frequent adjustment of instances with minimum overhead. We have implemented a prototype and evaluated it using real-world applications and traces from the public cloud platform. Our evaluation shows a 54.8% improvement in deployment density over commercial clouds (with Kubernetes) while maintaining QoS, and 81.0%--93.7% lower scheduling costs and a 57.4%--69.3% reduction in cold start latency compared to existing QoS-aware schedulers in research work.
format Preprint
id arxiv_https___arxiv_org_abs_2403_00433
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Jiagu: Optimizing Serverless Computing Resource Utilization with Harmonized Efficiency and Practicability
Liu, Qingyuan
Yang, Yanning
Du, Dong
Xia, Yubin
Zhang, Ping
Feng, Jia
Larus, James
Chen, Haibo
Distributed, Parallel, and Cluster Computing
Current serverless platforms struggle to optimize resource utilization due to their dynamic and fine-grained nature. Conventional techniques like overcommitment and autoscaling fall short, often sacrificing utilization for practicability or incurring performance trade-offs. Overcommitment requires predicting performance to prevent QoS violation, introducing trade-off between prediction accuracy and overheads. Autoscaling requires scaling instances in response to load fluctuations quickly to reduce resource wastage, but more frequent scaling also leads to more cold start overheads. This paper introduces Jiagu, which harmonizes efficiency with practicability through two novel techniques. First, pre-decision scheduling achieves accurate prediction while eliminating overheads by decoupling prediction and scheduling. Second, dual-staged scaling achieves frequent adjustment of instances with minimum overhead. We have implemented a prototype and evaluated it using real-world applications and traces from the public cloud platform. Our evaluation shows a 54.8% improvement in deployment density over commercial clouds (with Kubernetes) while maintaining QoS, and 81.0%--93.7% lower scheduling costs and a 57.4%--69.3% reduction in cold start latency compared to existing QoS-aware schedulers in research work.
title Jiagu: Optimizing Serverless Computing Resource Utilization with Harmonized Efficiency and Practicability
topic Distributed, Parallel, and Cluster Computing
url https://arxiv.org/abs/2403.00433