Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Xiaoqi, Du, Hongyang, Gao, Yuehong, Kim, Dong In
Format:	Preprint
Published:	2025
Subjects:	Systems and Control Machine Learning
Online Access:	https://arxiv.org/abs/2503.04418
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909527787438080
author	Wang, Xiaoqi Du, Hongyang Gao, Yuehong Kim, Dong In
author_facet	Wang, Xiaoqi Du, Hongyang Gao, Yuehong Kim, Dong In
contents	Recent advancements in large language models (LLMs) have led to their widespread adoption and large-scale deployment across various domains. However, their environmental impact, particularly during inference, has become a growing concern due to their substantial energy consumption and carbon footprint. Existing research has focused on inference computation alone, overlooking the analysis and optimization of carbon footprint in network-aided LLM service systems. To address this gap, we propose AOLO, a framework for analysis and optimization for low-carbon oriented wireless LLM services. AOLO introduces a comprehensive carbon footprint model that quantifies greenhouse gas emissions across the entire LLM service chain, including computational inference and wireless communication. Furthermore, we formulate an optimization problem aimed at minimizing the overall carbon footprint, which is solved through joint optimization of inference outputs and transmit power under quality-of-experience and system performance constraints. To achieve this joint optimization, we leverage the energy efficiency of spiking neural networks (SNNs) by adopting SNN as the actor network and propose a low-carbon-oriented optimization algorithm, i.e., SNN-based deep reinforcement learning (SDRL). Comprehensive simulations demonstrate that SDRL algorithm significantly reduces overall carbon footprint, achieving an 18.77% reduction compared to the benchmark soft actor-critic, highlighting its potential for enabling more sustainable LLM inference services.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_04418
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services Wang, Xiaoqi Du, Hongyang Gao, Yuehong Kim, Dong In Systems and Control Machine Learning Recent advancements in large language models (LLMs) have led to their widespread adoption and large-scale deployment across various domains. However, their environmental impact, particularly during inference, has become a growing concern due to their substantial energy consumption and carbon footprint. Existing research has focused on inference computation alone, overlooking the analysis and optimization of carbon footprint in network-aided LLM service systems. To address this gap, we propose AOLO, a framework for analysis and optimization for low-carbon oriented wireless LLM services. AOLO introduces a comprehensive carbon footprint model that quantifies greenhouse gas emissions across the entire LLM service chain, including computational inference and wireless communication. Furthermore, we formulate an optimization problem aimed at minimizing the overall carbon footprint, which is solved through joint optimization of inference outputs and transmit power under quality-of-experience and system performance constraints. To achieve this joint optimization, we leverage the energy efficiency of spiking neural networks (SNNs) by adopting SNN as the actor network and propose a low-carbon-oriented optimization algorithm, i.e., SNN-based deep reinforcement learning (SDRL). Comprehensive simulations demonstrate that SDRL algorithm significantly reduces overall carbon footprint, achieving an 18.77% reduction compared to the benchmark soft actor-critic, highlighting its potential for enabling more sustainable LLM inference services.
title	AOLO: Analysis and Optimization For Low-Carbon Oriented Wireless Large Language Model Services
topic	Systems and Control Machine Learning
url	https://arxiv.org/abs/2503.04418

Similar Items