Saved in:
Bibliographic Details
Main Authors: Liang, Zhirui, Chung, Jae-Won, Chowdhury, Mosharaf, Chen, Jiasi, Dvorkin, Vladimir
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.05116
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917249036582912
author Liang, Zhirui
Chung, Jae-Won
Chowdhury, Mosharaf
Chen, Jiasi
Dvorkin, Vladimir
author_facet Liang, Zhirui
Chung, Jae-Won
Chowdhury, Mosharaf
Chen, Jiasi
Dvorkin, Vladimir
contents While the rapid expansion of data centers poses challenges for power grids, it also offers new opportunities as potentially flexible loads. Existing power system research often abstracts data centers as aggregate resources, while computer system research primarily focuses on optimizing GPU energy efficiency and largely ignores the grid impacts of optimized GPU power consumption. To bridge this gap, we develop a GPU-to-Grid framework that couples device-level GPU control with power system objectives. We study distribution-level voltage regulation enabled by flexibility in LLM inference, using batch size as a control knob that trades off the voltage impacts of GPU power consumption against inference latency and token throughput. We first formulate this problem as an optimization problem and then realize it as an online feedback optimization controller that leverages measurements from both the power grid and GPU systems. Our key insight is that reducing GPU power consumption alleviates violations of lower voltage limits, while increasing GPU power mitigates violations near upper voltage limits in distribution systems; this runs counter to the common belief that minimizing GPU power consumption is always beneficial to power grids.
format Preprint
id arxiv_https___arxiv_org_abs_2602_05116
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle GPU-to-Grid: Voltage Regulation via GPU Utilization Control
Liang, Zhirui
Chung, Jae-Won
Chowdhury, Mosharaf
Chen, Jiasi
Dvorkin, Vladimir
Systems and Control
While the rapid expansion of data centers poses challenges for power grids, it also offers new opportunities as potentially flexible loads. Existing power system research often abstracts data centers as aggregate resources, while computer system research primarily focuses on optimizing GPU energy efficiency and largely ignores the grid impacts of optimized GPU power consumption. To bridge this gap, we develop a GPU-to-Grid framework that couples device-level GPU control with power system objectives. We study distribution-level voltage regulation enabled by flexibility in LLM inference, using batch size as a control knob that trades off the voltage impacts of GPU power consumption against inference latency and token throughput. We first formulate this problem as an optimization problem and then realize it as an online feedback optimization controller that leverages measurements from both the power grid and GPU systems. Our key insight is that reducing GPU power consumption alleviates violations of lower voltage limits, while increasing GPU power mitigates violations near upper voltage limits in distribution systems; this runs counter to the common belief that minimizing GPU power consumption is always beneficial to power grids.
title GPU-to-Grid: Voltage Regulation via GPU Utilization Control
topic Systems and Control
url https://arxiv.org/abs/2602.05116