Saved in:
Bibliographic Details
Main Authors: Herabad, Mohammadsadeq Garshasbi, Taheri, Javid, Ahmed, Bestoun S., Curescu, Calin
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2606.02259
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913179931508736
author Herabad, Mohammadsadeq Garshasbi
Taheri, Javid
Ahmed, Bestoun S.
Curescu, Calin
author_facet Herabad, Mohammadsadeq Garshasbi
Taheri, Javid
Ahmed, Bestoun S.
Curescu, Calin
contents The edge-cloud paradigm improves service delivery by orchestrating resources across edge nodes and cloud data centres. These environments consist of heterogeneous, interconnected computing nodes that cooperate to deliver continuous services. However, their scale and complexity increase vulnerability to failures from hardware malfunctions, software defects, and dynamic operating conditions. These failures can disrupt system configurations and service execution, leading to reduced reliability, performance degradation, and violations of service-level objectives. Ensuring service execution requires adaptive service placement strategies across edge-cloud resources. This study introduces a fault-tolerant service placement approach (Enhanced Evolution Strategy for Collaborative Neural Decision-making, EES-CND) for edge-cloud environments. The method employs collaborative decision-making, wherein multiple lightweight neural networks jointly infer redeployment strategies during failure events. To address the system dynamics and mitigate performance drift, adaptive models are updated online using an enhanced evolution strategy. Extensive simulations show that EES-CND effectively handles performance drift and significantly outperforms existing methods in service recovery time, response time, and reliability, achieving a 44.8\% reduction in fault-tolerance cost compared to standalone models.
format Preprint
id arxiv_https___arxiv_org_abs_2606_02259
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle EES-CND: Collaborative Neural Decision-Making for Drift-Aware Fault-Tolerant Edge-Cloud Service Placement
Herabad, Mohammadsadeq Garshasbi
Taheri, Javid
Ahmed, Bestoun S.
Curescu, Calin
Distributed, Parallel, and Cluster Computing
The edge-cloud paradigm improves service delivery by orchestrating resources across edge nodes and cloud data centres. These environments consist of heterogeneous, interconnected computing nodes that cooperate to deliver continuous services. However, their scale and complexity increase vulnerability to failures from hardware malfunctions, software defects, and dynamic operating conditions. These failures can disrupt system configurations and service execution, leading to reduced reliability, performance degradation, and violations of service-level objectives. Ensuring service execution requires adaptive service placement strategies across edge-cloud resources. This study introduces a fault-tolerant service placement approach (Enhanced Evolution Strategy for Collaborative Neural Decision-making, EES-CND) for edge-cloud environments. The method employs collaborative decision-making, wherein multiple lightweight neural networks jointly infer redeployment strategies during failure events. To address the system dynamics and mitigate performance drift, adaptive models are updated online using an enhanced evolution strategy. Extensive simulations show that EES-CND effectively handles performance drift and significantly outperforms existing methods in service recovery time, response time, and reliability, achieving a 44.8\% reduction in fault-tolerance cost compared to standalone models.
title EES-CND: Collaborative Neural Decision-Making for Drift-Aware Fault-Tolerant Edge-Cloud Service Placement
topic Distributed, Parallel, and Cluster Computing
url https://arxiv.org/abs/2606.02259