:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Timothy Tin Long, Singh, Gursimran, Shi, Ge, Sadri, Hanieh, Zhang, Yong, Fan, Zhenan
Format:	Preprint
Published:	2026
Subjects:	Distributed, Parallel, and Cluster Computing Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.08527
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models
by: Singh, Gursimran, et al.
Published: (2025)

ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale
by: Shi, Ge, et al.
Published: (2025)

A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems
by: Zhang, Xiaopei, et al.
Published: (2025)

Efficiently Serving Large Multimodal Models Using EPD Disaggregation
by: Singh, Gursimran, et al.
Published: (2024)

Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing
by: Luo, Yizhou, et al.
Published: (2024)

Caching Aided Multi-Tenant Serverless Computing
by: Qiao, Chu, et al.
Published: (2024)

Guardian: Safe GPU Sharing in Multi-Tenant Environments
by: Pavlidakis, Manos, et al.
Published: (2024)

The National Research Platform: Stretched, Multi-Tenant, Scientific Kubernetes Cluster
by: Weitzel, Derek, et al.
Published: (2025)

Trabant: A Serverless Architecture for Multi-Tenant Orbital Edge Computing
by: Pfandzelter, Tobias, et al.
Published: (2025)

Boosting Asynchronous Decentralized Learning with Model Fragmentation
by: Biswas, Sayan, et al.
Published: (2024)

HFX: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling
by: Yousefijamarani, Zahra, et al.
Published: (2025)

EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices
by: Shen, Zheyu, et al.
Published: (2025)

Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge
by: Li, Songyuan, et al.
Published: (2025)

Multi Armed Bandit Algorithms Based Virtual Machine Allocation Policy for Security in Multi-Tenant Distributed Systems
by: Patil, Pravin, et al.
Published: (2024)

Collaborative Processing for Multi-Tenant Inference on Memory-Constrained Edge TPUs
by: Ng, Nathan, et al.
Published: (2026)

Delta Fair Sharing: Performance Isolation for Multi-Tenant Storage Systems
by: Griggs, Tyler, et al.
Published: (2026)

MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing
by: Xue, Chunyu, et al.
Published: (2026)

Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale
by: Zhao, Kaiyang, et al.
Published: (2026)

MUSE: Multi-Tenant Model Serving With Seamless Model Updates
by: Correia, Cláudio, et al.
Published: (2026)

Efficient Asynchronous Federated Learning with Sparsification and Quantization
by: Jia, Juncheng, et al.
Published: (2023)

Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
by: Gu, Yan, et al.
Published: (2025)

Asynchronous Personalized Federated Learning through Global Memorization
by: Wan, Fan, et al.
Published: (2025)

Deep Reinforcement Learning based Online Scheduling Policy for Deep Neural Network Multi-Tenant Multi-Accelerator Systems
by: Blanco, Francesco G., et al.
Published: (2024)

Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning
by: Russo, Enrico, et al.
Published: (2024)

Asynchronous Secure Federated Learning with Byzantine aggregators
by: Del Pozzo, Antonella, et al.
Published: (2026)

Fairness-Aware Job Scheduling for Multi-Job Federated Learning
by: Shi, Yuxin, et al.
Published: (2024)

NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
by: Gao, Fei, et al.
Published: (2024)

DORA: A Scalable Asynchronous Reinforcement Learning System for Language Model Training
by: Hu, Tianhao, et al.
Published: (2026)

Mitigating Persistent Client Dropout in Asynchronous Decentralized Federated Learning
by: Stępka, Ignacy, et al.
Published: (2025)

DynTaskMAS: A Dynamic Task Graph-driven Framework for Asynchronous and Parallel LLM-based Multi-Agent Systems
by: Yu, Junwei, et al.
Published: (2025)

THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs
by: Karabulut, Emre, et al.
Published: (2024)

FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving
by: Wang, Minghe, et al.
Published: (2026)

EchoPFL: Asynchronous Personalized Federated Learning on Mobile Devices with On-Demand Staleness Control
by: Li, Xiaochen, et al.
Published: (2024)

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
by: Wang, Taiyi, et al.
Published: (2024)

FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
by: Xu, Haotian, et al.
Published: (2024)

APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs
by: Fan, Jiakun, et al.
Published: (2025)

OpenTinker: Separating Concerns in Agentic Reinforcement Learning
by: Zhu, Siqi, et al.
Published: (2026)

Towards Adaptive Asynchronous Federated Learning for Human Activity Recognition
by: Gajanin, Rastko, et al.
Published: (2024)

Deploying Foundation Model Powered Agent Services: A Survey
by: Xu, Wenchao, et al.
Published: (2024)

Private Virtual Tree Networks for Secure Multi-Tenant Environments Based on the VIRGO Overlay Network
by: Huang, Lican
Published: (2025)