Uloženo v:
Podrobná bibliografie
Hlavní autor: Singh, Ramkinker
Médium: Recurso digital
Jazyk:angličtina
Vydáno: Zenodo 2025
Témata:
On-line přístup:https://doi.org/10.5281/zenodo.17283268
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Obsah:
  • <p>The rise of complexity and scale in machine learning (ML) workflows and increasing adoption of heterogeneous cloud infrastructures has made cost-effective scheduling of pipelines challenging. Traditional scheduling mechanisms often don't account for variabilities in pricing, efficiency in energy consumption, heterogeneity in resources, or interoperability across clouds, which results in suboptimal costs and inefficiencies in resource utilization. In this paper, we will review the current literature and newer methods that focus on cost-aware scheduling of ML pipelines, in the aforementioned environments. We will focus on intelligent scheduling mechanisms based on reinforcement learning, AI-based scheduling, and optimization, energy aware scheduling policies, and global orchestration. In particular, we will review the recent advances in the use of evolutionary algorithms in scheduling, cloud agnostic scheduling frameworks, and carbon aware scheduling and infrastructure management, to provide a large perspective on how heterogeneous computing environments can be harnessed to increase the performance and cost-effectiveness in ML workflows. We will aim to provide a state-of-the-art overview of methods and approaches that help researchers and practitioners optimize deployment strategies for large-scale ML in multi-cloud and hybrid architecture like exist today.</p>