Saved in:
| Main Authors: | Shyam, Gopal Krishna, Bharti, Priyanka |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.08139 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms
by: Ji, Cheng, et al.
Published: (2025)
by: Ji, Cheng, et al.
Published: (2025)
Cloud-Based AI Systems: Leveraging Large Language Models for Intelligent Fault Detection and Autonomous Self-Healing
by: Ji, Cheng, et al.
Published: (2025)
by: Ji, Cheng, et al.
Published: (2025)
Analytically-Driven Resource Management for Cloud-Native Microservices
by: Zhang, Yanqi, et al.
Published: (2024)
by: Zhang, Yanqi, et al.
Published: (2024)
The AI_INFN Platform: Artificial Intelligence Development in the Cloud
by: Anderlini, Lucio, et al.
Published: (2025)
by: Anderlini, Lucio, et al.
Published: (2025)
AI-Driven Cloud Resource Optimization for Multi-Cluster Environments
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)
Scalable Cloud-Native Architectures for Intelligent PMU Data Processing
by: Chockalingam, Nachiappan, et al.
Published: (2025)
by: Chockalingam, Nachiappan, et al.
Published: (2025)
Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
Adaptive AI-based Decentralized Resource Management in the Cloud-Edge Continuum
by: Li, Lanpei, et al.
Published: (2025)
by: Li, Lanpei, et al.
Published: (2025)
Safactory: A Scalable Agentic Infrastructure for Training Trustworthy Autonomous Intelligence
by: Chen, Xinquan, et al.
Published: (2026)
by: Chen, Xinquan, et al.
Published: (2026)
Optimized Cloud Resource Allocation Using Genetic Algorithms for Energy Efficiency and QoS Assurance
by: Panggabean, Caroline, et al.
Published: (2025)
by: Panggabean, Caroline, et al.
Published: (2025)
AI4EOSC: a Federated Cloud Platform for Artificial Intelligence in Scientific Research
by: Heredia, Ignacio, et al.
Published: (2025)
by: Heredia, Ignacio, et al.
Published: (2025)
Artificial Intelligence for Cost-Aware Resource Prediction in Big Data Pipelines
by: Goyal, Harshit
Published: (2025)
by: Goyal, Harshit
Published: (2025)
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
by: Gu, Yan, et al.
Published: (2025)
by: Gu, Yan, et al.
Published: (2025)
DGRAG: Distributed Graph-based Retrieval-Augmented Generation in Edge-Cloud Systems
by: Zhou, Wenqing, et al.
Published: (2025)
by: Zhou, Wenqing, et al.
Published: (2025)
ECCENTRIC: Edge-Cloud Collaboration Framework for Distributed Inference Using Knowledge Adaptation
by: Kamani, Mohammad Mahdi, et al.
Published: (2025)
by: Kamani, Mohammad Mahdi, et al.
Published: (2025)
Ensemble Method for System Failure Detection Using Large-Scale Telemetry Data
by: Mudgal, Priyanka, et al.
Published: (2024)
by: Mudgal, Priyanka, et al.
Published: (2024)
Edge-Cloud Collaborative Computing on Distributed Intelligence and Model Optimization: A Survey
by: Liu, Jing, et al.
Published: (2025)
by: Liu, Jing, et al.
Published: (2025)
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs
by: Zhang, Chen, et al.
Published: (2026)
by: Zhang, Chen, et al.
Published: (2026)
Resource Slicing through Intelligent Orchestration of Energy-aware IoT services in Edge-Cloud Continuum
by: Shahid, Hafiz Faheem, et al.
Published: (2024)
by: Shahid, Hafiz Faheem, et al.
Published: (2024)
Distributed Inference on Mobile Edge and Cloud: A Data-Cartography based Clustering Approach
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
by: Bajpai, Divya Jyoti, et al.
Published: (2024)
Efficient Multi-Model Orchestration for Self-Hosted Large Language Models
by: Vangala, Bhanu Prakash, et al.
Published: (2025)
by: Vangala, Bhanu Prakash, et al.
Published: (2025)
Towards using Reinforcement Learning for Scaling and Data Replication in Cloud Systems
by: Mokadem, Riad, et al.
Published: (2024)
by: Mokadem, Riad, et al.
Published: (2024)
HPRM: High-Performance Robotic Middleware for Intelligent Autonomous Systems
by: Kwok, Jacky, et al.
Published: (2024)
by: Kwok, Jacky, et al.
Published: (2024)
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud
by: Wang, Qinlong, et al.
Published: (2023)
by: Wang, Qinlong, et al.
Published: (2023)
Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?
by: Kim, Taeyoon, et al.
Published: (2026)
by: Kim, Taeyoon, et al.
Published: (2026)
Mind the Boundary: Stabilizing Gemini Enterprise A2A via a Cloud Run Hub Across Projects and Accounts
by: Morita, Takao
Published: (2026)
by: Morita, Takao
Published: (2026)
Towards Carbon-Aware Container Orchestration: Predicting Workload Energy Consumption with Federated Learning
by: Saad, Zainab, et al.
Published: (2025)
by: Saad, Zainab, et al.
Published: (2025)
Resource Allocation and Workload Scheduling for Large-Scale Distributed Deep Learning: A Survey
by: Liang, Feng, et al.
Published: (2024)
by: Liang, Feng, et al.
Published: (2024)
Quantifying Energy and Cost Benefits of Hybrid Edge Cloud: Analysis of Traditional and Agentic Workloads
by: Alamouti, Siavash
Published: (2025)
by: Alamouti, Siavash
Published: (2025)
CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation
by: Xu, Yifei, et al.
Published: (2023)
by: Xu, Yifei, et al.
Published: (2023)
IslandRun: Privacy-Aware Multi-Objective Orchestration for Distributed AI Inference
by: Malepati, Bala Siva Sai Akhil
Published: (2025)
by: Malepati, Bala Siva Sai Akhil
Published: (2025)
Intelligent Load Balancing in Cloud Computer Systems
by: Sliwko, Leszek
Published: (2025)
by: Sliwko, Leszek
Published: (2025)
OrchMLLM: Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training
by: Zheng, Yijie, et al.
Published: (2025)
by: Zheng, Yijie, et al.
Published: (2025)
Dynamic Resource Allocation for Virtual Machine Migration Optimization using Machine Learning
by: Gong, Yulu, et al.
Published: (2024)
by: Gong, Yulu, et al.
Published: (2024)
Building AI Agents for Autonomous Clouds: Challenges and Design Principles
by: Shetty, Manish, et al.
Published: (2024)
by: Shetty, Manish, et al.
Published: (2024)
DeF-DReL: Systematic Deployment of Serverless Functions in Fog and Cloud environments using Deep Reinforcement Learning
by: Dehury, Chinmaya Kumar, et al.
Published: (2021)
by: Dehury, Chinmaya Kumar, et al.
Published: (2021)
TD3-Sched: Learning to Orchestrate Container-based Cloud-Edge Resources via Distributed Reinforcement Learning
by: Song, Shengye, et al.
Published: (2025)
by: Song, Shengye, et al.
Published: (2025)
Thousand-GPU Large-Scale Training and Optimization Recipe for AI-Native Cloud Embodied Intelligence Infrastructure
by: Guo, Yongjian, et al.
Published: (2026)
by: Guo, Yongjian, et al.
Published: (2026)
MLCommons Cloud Masking Benchmark with Early Stopping
by: Chennamsetti, Varshitha, et al.
Published: (2023)
by: Chennamsetti, Varshitha, et al.
Published: (2023)
Research on the Application of Spark Streaming Real-Time Data Analysis System and large language model Intelligent Agents
by: Wang, Jialin, et al.
Published: (2024)
by: Wang, Jialin, et al.
Published: (2024)
Similar Items
-
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms
by: Ji, Cheng, et al.
Published: (2025) -
Cloud-Based AI Systems: Leveraging Large Language Models for Intelligent Fault Detection and Autonomous Self-Healing
by: Ji, Cheng, et al.
Published: (2025) -
Analytically-Driven Resource Management for Cloud-Native Microservices
by: Zhang, Yanqi, et al.
Published: (2024) -
The AI_INFN Platform: Artificial Intelligence Development in the Cloud
by: Anderlini, Lucio, et al.
Published: (2025) -
AI-Driven Cloud Resource Optimization for Multi-Cluster Environments
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)