Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Fangqi, Sen, Rishav, Talusan, Jose Paolo, Pettet, Ava, Kandel, Aaron, Suzue, Yoshinori, Mukhopadhyay, Ayan, Dubey, Abhishek
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.18526
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912247093133312
author	Liu, Fangqi Sen, Rishav Talusan, Jose Paolo Pettet, Ava Kandel, Aaron Suzue, Yoshinori Mukhopadhyay, Ayan Dubey, Abhishek
author_facet	Liu, Fangqi Sen, Rishav Talusan, Jose Paolo Pettet, Ava Kandel, Aaron Suzue, Yoshinori Mukhopadhyay, Ayan Dubey, Abhishek
contents	Strategic aggregation of electric vehicle batteries as energy reservoirs can optimize power grid demand, benefiting smart and connected communities, especially large office buildings that offer workplace charging. This involves optimizing charging and discharging to reduce peak energy costs and net peak demand, monitored over extended periods (e.g., a month), which involves making sequential decisions under uncertainty and delayed and sparse rewards, a continuous action space, and the complexity of ensuring generalization across diverse conditions. Existing algorithmic approaches, e.g., heuristic-based strategies, fall short in addressing real-time decision-making under dynamic conditions, and traditional reinforcement learning (RL) models struggle with large state-action spaces, multi-agent settings, and the need for long-term reward optimization. To address these challenges, we introduce a novel RL framework that combines the Deep Deterministic Policy Gradient approach (DDPG) with action masking and efficient MILP-driven policy guidance. Our approach balances the exploration of continuous action spaces to meet user charging demands. Using real-world data from a major electric vehicle manufacturer, we show that our approach comprehensively outperforms many well-established baselines and several scalable heuristic approaches, achieving significant cost savings while meeting all charging requirements. Our results show that the proposed approach is one of the first scalable and general approaches to solving the V2B energy management challenge.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_18526
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards Liu, Fangqi Sen, Rishav Talusan, Jose Paolo Pettet, Ava Kandel, Aaron Suzue, Yoshinori Mukhopadhyay, Ayan Dubey, Abhishek Machine Learning Artificial Intelligence Strategic aggregation of electric vehicle batteries as energy reservoirs can optimize power grid demand, benefiting smart and connected communities, especially large office buildings that offer workplace charging. This involves optimizing charging and discharging to reduce peak energy costs and net peak demand, monitored over extended periods (e.g., a month), which involves making sequential decisions under uncertainty and delayed and sparse rewards, a continuous action space, and the complexity of ensuring generalization across diverse conditions. Existing algorithmic approaches, e.g., heuristic-based strategies, fall short in addressing real-time decision-making under dynamic conditions, and traditional reinforcement learning (RL) models struggle with large state-action spaces, multi-agent settings, and the need for long-term reward optimization. To address these challenges, we introduce a novel RL framework that combines the Deep Deterministic Policy Gradient approach (DDPG) with action masking and efficient MILP-driven policy guidance. Our approach balances the exploration of continuous action spaces to meet user charging demands. Using real-world data from a major electric vehicle manufacturer, we show that our approach comprehensively outperforms many well-established baselines and several scalable heuristic approaches, achieving significant cost savings while meeting all charging requirements. Our results show that the proposed approach is one of the first scalable and general approaches to solving the V2B energy management challenge.
title	Reinforcement Learning-based Approach for Vehicle-to-Building Charging with Heterogeneous Agents and Long Term Rewards
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2502.18526

Similar Items