Saved in:
Bibliographic Details
Main Authors: Mohammadi, Ava, Kramer, Rick, Nagy, Zoltan
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.01362
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913083538014208
author Mohammadi, Ava
Kramer, Rick
Nagy, Zoltan
author_facet Mohammadi, Ava
Kramer, Rick
Nagy, Zoltan
contents Grid-integrated building districts must provide energy flexibility while preserving occupant comfort and equitable distribution of control burden. We study how coordination architecture influences the ability of building clusters to track aggregated load profiles, comparing four paradigms: centralized model predictive control (MPC), decentralized independent reinforcement learning (SAC), centralized-training-decentralized-execution multi-agent RL (MAPPO), and a hybrid MPC--SAC controller that separates district-level battery optimization from building-level HVAC regulation. A rule-based controller serves as a baseline. We evaluate a 25-building residential district across three metrics: aggregate load tracking, thermal comfort, and spatial variability of control actions. We find that architecture choice determines the trade-off structure. Centralized MPC achieves low tracking bias (8.8% NMBE) but concentrates actuation on a subset of buildings, causing elevated comfort violations (24.8% exceedance) and spatial imbalance. Decentralized RL distributes control effort more evenly but fails to sustain accurate tracking. The hybrid architecture achieves the best balance: accurate tracking (4.8% NMBE), moderate comfort impact (16.8% exceedance), and the lowest spatial variability. These findings demonstrate that architecture choice determines the trade-off structure between tracking and comfort.
format Preprint
id arxiv_https___arxiv_org_abs_2605_01362
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Coordination Architecture Shapes Continuous Demand Response Outcomes in Building Districts
Mohammadi, Ava
Kramer, Rick
Nagy, Zoltan
Systems and Control
Grid-integrated building districts must provide energy flexibility while preserving occupant comfort and equitable distribution of control burden. We study how coordination architecture influences the ability of building clusters to track aggregated load profiles, comparing four paradigms: centralized model predictive control (MPC), decentralized independent reinforcement learning (SAC), centralized-training-decentralized-execution multi-agent RL (MAPPO), and a hybrid MPC--SAC controller that separates district-level battery optimization from building-level HVAC regulation. A rule-based controller serves as a baseline. We evaluate a 25-building residential district across three metrics: aggregate load tracking, thermal comfort, and spatial variability of control actions. We find that architecture choice determines the trade-off structure. Centralized MPC achieves low tracking bias (8.8% NMBE) but concentrates actuation on a subset of buildings, causing elevated comfort violations (24.8% exceedance) and spatial imbalance. Decentralized RL distributes control effort more evenly but fails to sustain accurate tracking. The hybrid architecture achieves the best balance: accurate tracking (4.8% NMBE), moderate comfort impact (16.8% exceedance), and the lowest spatial variability. These findings demonstrate that architecture choice determines the trade-off structure between tracking and comfort.
title Coordination Architecture Shapes Continuous Demand Response Outcomes in Building Districts
topic Systems and Control
url https://arxiv.org/abs/2605.01362