Saved in:
Bibliographic Details
Main Authors: Dehury, Chinmaya Kumar, Kushwaha, Siddharth Singh, Zhang, Qiyang, Saleh, Alaa, Donta, Praveen Kumar
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.09607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911583979962368
author Dehury, Chinmaya Kumar
Kushwaha, Siddharth Singh
Zhang, Qiyang
Saleh, Alaa
Donta, Praveen Kumar
author_facet Dehury, Chinmaya Kumar
Kushwaha, Siddharth Singh
Zhang, Qiyang
Saleh, Alaa
Donta, Praveen Kumar
contents Edge intelligence delivers low-latency inference, yet most edge analytics remain hard-coded and must be redeployed as conditions change. When data patterns shift or new questions arise, engineers often need to write new scripts and push updates to devices, which slows iteration and raises operating costs. This limited adaptability reduces scalability and autonomy in large, heterogeneous, and resource-constrained edge deployments, and it increases reliance on human oversight. Meanwhile, large language models (LLMs) can interpret instructions and generate code, but their compute and memory requirements typically prevent direct deployment on edge devices. We address this gap with the LLM-assisted Edge Intelligence (LEI) framework, which removes the need for manually specified business logic. In LEI, a cloud-hosted LLM coordinates the creation and update of device-side logic as requirements evolve. The system generates candidate lightweight programs, checks them against available data and constraints, and then deploys the selected version to each device. This lets each device receive a tailored program based on sample data, metadata, context, and current resource limits. We evaluate LEI on four heterogeneous datasets, including air quality, temperature \& humidity, wind, and soil datasets using multiple LLM backends. The experimental results show that the framework maintains low average CPU and memory utilization during the execution. These results indicate that the framework adapts efficiently to changing conditions while maintaining resource efficiency.
format Preprint
id arxiv_https___arxiv_org_abs_2604_09607
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle LLM-assisted Agentic Edge Intelligence Framework
Dehury, Chinmaya Kumar
Kushwaha, Siddharth Singh
Zhang, Qiyang
Saleh, Alaa
Donta, Praveen Kumar
Distributed, Parallel, and Cluster Computing
Edge intelligence delivers low-latency inference, yet most edge analytics remain hard-coded and must be redeployed as conditions change. When data patterns shift or new questions arise, engineers often need to write new scripts and push updates to devices, which slows iteration and raises operating costs. This limited adaptability reduces scalability and autonomy in large, heterogeneous, and resource-constrained edge deployments, and it increases reliance on human oversight. Meanwhile, large language models (LLMs) can interpret instructions and generate code, but their compute and memory requirements typically prevent direct deployment on edge devices. We address this gap with the LLM-assisted Edge Intelligence (LEI) framework, which removes the need for manually specified business logic. In LEI, a cloud-hosted LLM coordinates the creation and update of device-side logic as requirements evolve. The system generates candidate lightweight programs, checks them against available data and constraints, and then deploys the selected version to each device. This lets each device receive a tailored program based on sample data, metadata, context, and current resource limits. We evaluate LEI on four heterogeneous datasets, including air quality, temperature \& humidity, wind, and soil datasets using multiple LLM backends. The experimental results show that the framework maintains low average CPU and memory utilization during the execution. These results indicate that the framework adapts efficiently to changing conditions while maintaining resource efficiency.
title LLM-assisted Agentic Edge Intelligence Framework
topic Distributed, Parallel, and Cluster Computing
url https://arxiv.org/abs/2604.09607