:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ogunsina, Kolawole E., Ogunsina, Morayo A.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Distributed, Parallel, and Cluster Computing
Online Access:	https://arxiv.org/abs/2505.03553
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Verifying the Hashgraph Consensus Algorithm
by: Crary, Karl
Published: (2021)

Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems
by: Lu, Ning, et al.
Published: (2024)

PRAGMA: A Profiling-Reasoned Multi-Agent Framework for Automatic Kernel Optimization
by: Lei, Kelun, et al.
Published: (2025)

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads
by: Agyemang, Justice Owusu, et al.
Published: (2026)

LogAct: Enabling Agentic Reliability via Shared Logs
by: Balakrishnan, Mahesh, et al.
Published: (2026)

A biologically Inspired Trust Model for Open Multi-Agent Systems that is Resilient to Rapid Performance Fluctuations
by: Lygizou, Zoi, et al.
Published: (2025)

ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation
by: Kaplan, Erel, et al.
Published: (2026)

Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification
by: Ranjbar, Behnaz, et al.
Published: (2026)

Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
by: Jin, Yihong, et al.
Published: (2025)

Efficient Multi-Model Orchestration for Self-Hosted Large Language Models
by: Vangala, Bhanu Prakash, et al.
Published: (2025)

Towards Multi-Model LLM Schedulers: Empirical Insights into Offloading and Preemption
by: Yildiz, Mert, et al.
Published: (2026)

Glia: A Human-Inspired AI for Automated Systems Design and Optimization
by: Hamadanian, Pouya, et al.
Published: (2025)

Characterizing Performance-Energy Trade-offs of Large Language Models in Multi-Request Workflows
by: Ifath, Md. Monzurul Amin, et al.
Published: (2026)

TimelyFreeze: Adaptive Parameter Freezing Mechanism for Pipeline Parallelism
by: Cho, Seonghye, et al.
Published: (2026)

Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling
by: Jadhav, Prachi, et al.
Published: (2025)

KORAL: Knowledge Graph Guided LLM Reasoning for SSD Operational Analysis
by: Akewar, Mayur, et al.
Published: (2026)

Cooperative Cognitive Dynamic System in UAV Swarms: Reconfigurable Mechanism and Framework
by: Jia, Ziye, et al.
Published: (2024)

Zipage: Maintain High Request Concurrency for LLM Reasoning through Compressed PagedAttention
by: Liao, Mengqi, et al.
Published: (2026)

Adaptive Consensus Gradients Aggregation for Scaled Distributed Training
by: Choukroun, Yoni, et al.
Published: (2024)

On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
by: Singh, Jaskirat, et al.
Published: (2024)

DataCenterGym: A Physics-Grounded Simulator for Multi-Objective Data Center Scheduling
by: Pathak, Nilavra, et al.
Published: (2026)

Byzantine-Tolerant Consensus in GPU-Inspired Shared Memory
by: Georgiou, Chryssis, et al.
Published: (2025)

Byzantine Fault-Tolerant Multi-Agent System for Healthcare: A Gossip Protocol Approach to Secure Medical Message Propagation
by: Chadderwala, Nihir
Published: (2025)

AI-Driven Cloud Resource Optimization for Multi-Cluster Environments
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)

MARLaaS: Multi-Tenant Asynchronous Reinforcement Learning as a Service
by: Yu, Timothy Tin Long, et al.
Published: (2026)

Delay-Aware Multi-Stage Edge Server Upgrade with Budget Constraint
by: Wihidayat, Endar Suprih, et al.
Published: (2025)

StreamWise: Serving Multi-Modal Generation in Real-Time at Scale
by: Qiu, Haoran, et al.
Published: (2026)

The Cognitive Penalty: Ablating System 1 and System 2 Reasoning in Edge-Native SLMs for Decentralized Consensus
by: Rizvi, Syed Muhammad Aqdas
Published: (2026)

Deploying Foundation Model Powered Agent Services: A Survey
by: Xu, Wenchao, et al.
Published: (2024)

Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling
by: Mamirov, Akhmadillo
Published: (2025)

HFX: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling
by: Yousefijamarani, Zahra, et al.
Published: (2025)

Electricity Cost Minimization for Multi-Workflow Allocation in Geo-Distributed Data Centers
by: Wang, Shuang, et al.
Published: (2025)

Xe-Forge: Multi-Stage LLM-Powered Kernel Optimization for Intel GPU
by: Spoczynski, Marcin, et al.
Published: (2026)

CoRaiS: Lightweight Real-Time Scheduler for Multi-Edge Cooperative Computing
by: Hu, Yujiao, et al.
Published: (2024)

Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
by: Bhatia, Nidhi, et al.
Published: (2025)

FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference
by: Zhao, Bingzhe, et al.
Published: (2025)

Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple Formats
by: Davidson, Sam, et al.
Published: (2025)

ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture
by: Dobrin, Seth, et al.
Published: (2026)

Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
by: Firoz, Jesun, et al.
Published: (2025)

Joint Resource Optimization, Computation Offloading and Resource Slicing for Multi-Edge Traffic-Cognitive Networks
by: Xiaoyang, Ting, et al.
Published: (2024)