Saved in:
| Main Authors: | Ogunsina, Kolawole E., Ogunsina, Morayo A. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.03553 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Verifying the Hashgraph Consensus Algorithm
by: Crary, Karl
Published: (2021)
by: Crary, Karl
Published: (2021)
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems
by: Lu, Ning, et al.
Published: (2024)
by: Lu, Ning, et al.
Published: (2024)
PRAGMA: A Profiling-Reasoned Multi-Agent Framework for Automatic Kernel Optimization
by: Lei, Kelun, et al.
Published: (2025)
by: Lei, Kelun, et al.
Published: (2025)
HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads
by: Agyemang, Justice Owusu, et al.
Published: (2026)
by: Agyemang, Justice Owusu, et al.
Published: (2026)
LogAct: Enabling Agentic Reliability via Shared Logs
by: Balakrishnan, Mahesh, et al.
Published: (2026)
by: Balakrishnan, Mahesh, et al.
Published: (2026)
A biologically Inspired Trust Model for Open Multi-Agent Systems that is Resilient to Rapid Performance Fluctuations
by: Lygizou, Zoi, et al.
Published: (2025)
by: Lygizou, Zoi, et al.
Published: (2025)
ParaCodex: A Profiling-Guided Autonomous Coding Agent for Reliable Parallel Code Generation and Translation
by: Kaplan, Erel, et al.
Published: (2026)
by: Kaplan, Erel, et al.
Published: (2026)
Autonomous Systems Dependability in the era of AI: Design Challenges in Safety, Security, Reliability and Certification
by: Ranjbar, Behnaz, et al.
Published: (2026)
by: Ranjbar, Behnaz, et al.
Published: (2026)
Adaptive Fault Tolerance Mechanisms of Large Language Models in Cloud Computing Environments
by: Jin, Yihong, et al.
Published: (2025)
by: Jin, Yihong, et al.
Published: (2025)
Efficient Multi-Model Orchestration for Self-Hosted Large Language Models
by: Vangala, Bhanu Prakash, et al.
Published: (2025)
by: Vangala, Bhanu Prakash, et al.
Published: (2025)
Towards Multi-Model LLM Schedulers: Empirical Insights into Offloading and Preemption
by: Yildiz, Mert, et al.
Published: (2026)
by: Yildiz, Mert, et al.
Published: (2026)
Glia: A Human-Inspired AI for Automated Systems Design and Optimization
by: Hamadanian, Pouya, et al.
Published: (2025)
by: Hamadanian, Pouya, et al.
Published: (2025)
Characterizing Performance-Energy Trade-offs of Large Language Models in Multi-Request Workflows
by: Ifath, Md. Monzurul Amin, et al.
Published: (2026)
by: Ifath, Md. Monzurul Amin, et al.
Published: (2026)
TimelyFreeze: Adaptive Parameter Freezing Mechanism for Pipeline Parallelism
by: Cho, Seonghye, et al.
Published: (2026)
by: Cho, Seonghye, et al.
Published: (2026)
Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling
by: Jadhav, Prachi, et al.
Published: (2025)
by: Jadhav, Prachi, et al.
Published: (2025)
KORAL: Knowledge Graph Guided LLM Reasoning for SSD Operational Analysis
by: Akewar, Mayur, et al.
Published: (2026)
by: Akewar, Mayur, et al.
Published: (2026)
Cooperative Cognitive Dynamic System in UAV Swarms: Reconfigurable Mechanism and Framework
by: Jia, Ziye, et al.
Published: (2024)
by: Jia, Ziye, et al.
Published: (2024)
Zipage: Maintain High Request Concurrency for LLM Reasoning through Compressed PagedAttention
by: Liao, Mengqi, et al.
Published: (2026)
by: Liao, Mengqi, et al.
Published: (2026)
Adaptive Consensus Gradients Aggregation for Scaled Distributed Training
by: Choukroun, Yoni, et al.
Published: (2024)
by: Choukroun, Yoni, et al.
Published: (2024)
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
by: Singh, Jaskirat, et al.
Published: (2024)
by: Singh, Jaskirat, et al.
Published: (2024)
DataCenterGym: A Physics-Grounded Simulator for Multi-Objective Data Center Scheduling
by: Pathak, Nilavra, et al.
Published: (2026)
by: Pathak, Nilavra, et al.
Published: (2026)
Byzantine-Tolerant Consensus in GPU-Inspired Shared Memory
by: Georgiou, Chryssis, et al.
Published: (2025)
by: Georgiou, Chryssis, et al.
Published: (2025)
Byzantine Fault-Tolerant Multi-Agent System for Healthcare: A Gossip Protocol Approach to Secure Medical Message Propagation
by: Chadderwala, Nihir
Published: (2025)
by: Chadderwala, Nihir
Published: (2025)
AI-Driven Cloud Resource Optimization for Multi-Cluster Environments
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)
by: Punniyamoorthy, Vinoth, et al.
Published: (2025)
MARLaaS: Multi-Tenant Asynchronous Reinforcement Learning as a Service
by: Yu, Timothy Tin Long, et al.
Published: (2026)
by: Yu, Timothy Tin Long, et al.
Published: (2026)
Delay-Aware Multi-Stage Edge Server Upgrade with Budget Constraint
by: Wihidayat, Endar Suprih, et al.
Published: (2025)
by: Wihidayat, Endar Suprih, et al.
Published: (2025)
StreamWise: Serving Multi-Modal Generation in Real-Time at Scale
by: Qiu, Haoran, et al.
Published: (2026)
by: Qiu, Haoran, et al.
Published: (2026)
The Cognitive Penalty: Ablating System 1 and System 2 Reasoning in Edge-Native SLMs for Decentralized Consensus
by: Rizvi, Syed Muhammad Aqdas
Published: (2026)
by: Rizvi, Syed Muhammad Aqdas
Published: (2026)
Deploying Foundation Model Powered Agent Services: A Survey
by: Xu, Wenchao, et al.
Published: (2024)
by: Xu, Wenchao, et al.
Published: (2024)
Reducing Fragmentation and Starvation in GPU Clusters through Dynamic Multi-Objective Scheduling
by: Mamirov, Akhmadillo
Published: (2025)
by: Mamirov, Akhmadillo
Published: (2025)
HFX: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling
by: Yousefijamarani, Zahra, et al.
Published: (2025)
by: Yousefijamarani, Zahra, et al.
Published: (2025)
Electricity Cost Minimization for Multi-Workflow Allocation in Geo-Distributed Data Centers
by: Wang, Shuang, et al.
Published: (2025)
by: Wang, Shuang, et al.
Published: (2025)
Xe-Forge: Multi-Stage LLM-Powered Kernel Optimization for Intel GPU
by: Spoczynski, Marcin, et al.
Published: (2026)
by: Spoczynski, Marcin, et al.
Published: (2026)
CoRaiS: Lightweight Real-Time Scheduler for Multi-Edge Cooperative Computing
by: Hu, Yujiao, et al.
Published: (2024)
by: Hu, Yujiao, et al.
Published: (2024)
Helix Parallelism: Rethinking Sharding Strategies for Interactive Multi-Million-Token LLM Decoding
by: Bhatia, Nidhi, et al.
Published: (2025)
by: Bhatia, Nidhi, et al.
Published: (2025)
FairKV: Balancing Per-Head KV Cache for Fast Multi-GPU Inference
by: Zhao, Bingzhe, et al.
Published: (2025)
by: Zhao, Bingzhe, et al.
Published: (2025)
Multi-IaC-Eval: Benchmarking Cloud Infrastructure as Code Across Multiple Formats
by: Davidson, Sam, et al.
Published: (2025)
by: Davidson, Sam, et al.
Published: (2025)
ARYA: A Physics-Constrained Composable & Deterministic World Model Architecture
by: Dobrin, Seth, et al.
Published: (2026)
by: Dobrin, Seth, et al.
Published: (2026)
Optimizing Data Distribution and Kernel Performance for Efficient Training of Chemistry Foundation Models: A Case Study with MACE
by: Firoz, Jesun, et al.
Published: (2025)
by: Firoz, Jesun, et al.
Published: (2025)
Joint Resource Optimization, Computation Offloading and Resource Slicing for Multi-Edge Traffic-Cognitive Networks
by: Xiaoyang, Ting, et al.
Published: (2024)
by: Xiaoyang, Ting, et al.
Published: (2024)
Similar Items
-
Verifying the Hashgraph Consensus Algorithm
by: Crary, Karl
Published: (2021) -
Training Overhead Ratio: A Practical Reliability Metric for Large Language Model Training Systems
by: Lu, Ning, et al.
Published: (2024) -
PRAGMA: A Profiling-Reasoned Multi-Agent Framework for Automatic Kernel Optimization
by: Lei, Kelun, et al.
Published: (2025) -
HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads
by: Agyemang, Justice Owusu, et al.
Published: (2026) -
LogAct: Enabling Agentic Reliability via Shared Logs
by: Balakrishnan, Mahesh, et al.
Published: (2026)