Saved in:
| Main Authors: | Yu, Timothy Tin Long, Singh, Gursimran, Shi, Ge, Sadri, Hanieh, Zhang, Yong, Fan, Zhenan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.08527 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models
by: Singh, Gursimran, et al.
Published: (2025)
by: Singh, Gursimran, et al.
Published: (2025)
ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale
by: Shi, Ge, et al.
Published: (2025)
by: Shi, Ge, et al.
Published: (2025)
A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems
by: Zhang, Xiaopei, et al.
Published: (2025)
by: Zhang, Xiaopei, et al.
Published: (2025)
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
by: Singh, Gursimran, et al.
Published: (2024)
by: Singh, Gursimran, et al.
Published: (2024)
Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing
by: Luo, Yizhou, et al.
Published: (2024)
by: Luo, Yizhou, et al.
Published: (2024)
Caching Aided Multi-Tenant Serverless Computing
by: Qiao, Chu, et al.
Published: (2024)
by: Qiao, Chu, et al.
Published: (2024)
Guardian: Safe GPU Sharing in Multi-Tenant Environments
by: Pavlidakis, Manos, et al.
Published: (2024)
by: Pavlidakis, Manos, et al.
Published: (2024)
The National Research Platform: Stretched, Multi-Tenant, Scientific Kubernetes Cluster
by: Weitzel, Derek, et al.
Published: (2025)
by: Weitzel, Derek, et al.
Published: (2025)
Trabant: A Serverless Architecture for Multi-Tenant Orbital Edge Computing
by: Pfandzelter, Tobias, et al.
Published: (2025)
by: Pfandzelter, Tobias, et al.
Published: (2025)
Boosting Asynchronous Decentralized Learning with Model Fragmentation
by: Biswas, Sayan, et al.
Published: (2024)
by: Biswas, Sayan, et al.
Published: (2024)
HFX: Joint Design of Algorithms and Systems for Multi-SLO Serving and Fast Scaling
by: Yousefijamarani, Zahra, et al.
Published: (2025)
by: Yousefijamarani, Zahra, et al.
Published: (2025)
EdgeLoRA: An Efficient Multi-Tenant LLM Serving System on Edge Devices
by: Shen, Zheyu, et al.
Published: (2025)
by: Shen, Zheyu, et al.
Published: (2025)
Incentivizing Multi-Tenant Split Federated Learning for Foundation Models at the Network Edge
by: Li, Songyuan, et al.
Published: (2025)
by: Li, Songyuan, et al.
Published: (2025)
Multi Armed Bandit Algorithms Based Virtual Machine Allocation Policy for Security in Multi-Tenant Distributed Systems
by: Patil, Pravin, et al.
Published: (2024)
by: Patil, Pravin, et al.
Published: (2024)
Collaborative Processing for Multi-Tenant Inference on Memory-Constrained Edge TPUs
by: Ng, Nathan, et al.
Published: (2026)
by: Ng, Nathan, et al.
Published: (2026)
Delta Fair Sharing: Performance Isolation for Multi-Tenant Storage Systems
by: Griggs, Tyler, et al.
Published: (2026)
by: Griggs, Tyler, et al.
Published: (2026)
MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing
by: Xue, Chunyu, et al.
Published: (2026)
by: Xue, Chunyu, et al.
Published: (2026)
Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale
by: Zhao, Kaiyang, et al.
Published: (2026)
by: Zhao, Kaiyang, et al.
Published: (2026)
MUSE: Multi-Tenant Model Serving With Seamless Model Updates
by: Correia, Cláudio, et al.
Published: (2026)
by: Correia, Cláudio, et al.
Published: (2026)
Efficient Asynchronous Federated Learning with Sparsification and Quantization
by: Jia, Juncheng, et al.
Published: (2023)
by: Jia, Juncheng, et al.
Published: (2023)
Deep Reinforcement Learning for Job Scheduling and Resource Management in Cloud Computing: An Algorithm-Level Review
by: Gu, Yan, et al.
Published: (2025)
by: Gu, Yan, et al.
Published: (2025)
Asynchronous Personalized Federated Learning through Global Memorization
by: Wan, Fan, et al.
Published: (2025)
by: Wan, Fan, et al.
Published: (2025)
Deep Reinforcement Learning based Online Scheduling Policy for Deep Neural Network Multi-Tenant Multi-Accelerator Systems
by: Blanco, Francesco G., et al.
Published: (2024)
by: Blanco, Francesco G., et al.
Published: (2024)
Towards Fair and Firm Real-Time Scheduling in DNN Multi-Tenant Multi-Accelerator Systems via Reinforcement Learning
by: Russo, Enrico, et al.
Published: (2024)
by: Russo, Enrico, et al.
Published: (2024)
Asynchronous Secure Federated Learning with Byzantine aggregators
by: Del Pozzo, Antonella, et al.
Published: (2026)
by: Del Pozzo, Antonella, et al.
Published: (2026)
Fairness-Aware Job Scheduling for Multi-Job Federated Learning
by: Shi, Yuxin, et al.
Published: (2024)
by: Shi, Yuxin, et al.
Published: (2024)
NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
by: Gao, Fei, et al.
Published: (2024)
by: Gao, Fei, et al.
Published: (2024)
DORA: A Scalable Asynchronous Reinforcement Learning System for Language Model Training
by: Hu, Tianhao, et al.
Published: (2026)
by: Hu, Tianhao, et al.
Published: (2026)
Mitigating Persistent Client Dropout in Asynchronous Decentralized Federated Learning
by: Stępka, Ignacy, et al.
Published: (2025)
by: Stępka, Ignacy, et al.
Published: (2025)
DynTaskMAS: A Dynamic Task Graph-driven Framework for Asynchronous and Parallel LLM-based Multi-Agent Systems
by: Yu, Junwei, et al.
Published: (2025)
by: Yu, Junwei, et al.
Published: (2025)
THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs
by: Karabulut, Emre, et al.
Published: (2024)
by: Karabulut, Emre, et al.
Published: (2024)
FaaSMoE: A Serverless Framework for Multi-Tenant Mixture-of-Experts Serving
by: Wang, Minghe, et al.
Published: (2026)
by: Wang, Minghe, et al.
Published: (2026)
EchoPFL: Asynchronous Personalized Federated Learning on Mobile Devices with On-Demand Staleness Control
by: Li, Xiaochen, et al.
Published: (2024)
by: Li, Xiaochen, et al.
Published: (2024)
DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
by: Wang, Taiyi, et al.
Published: (2024)
by: Wang, Taiyi, et al.
Published: (2024)
FedFa: A Fully Asynchronous Training Paradigm for Federated Learning
by: Xu, Haotian, et al.
Published: (2024)
by: Xu, Haotian, et al.
Published: (2024)
APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs
by: Fan, Jiakun, et al.
Published: (2025)
by: Fan, Jiakun, et al.
Published: (2025)
OpenTinker: Separating Concerns in Agentic Reinforcement Learning
by: Zhu, Siqi, et al.
Published: (2026)
by: Zhu, Siqi, et al.
Published: (2026)
Towards Adaptive Asynchronous Federated Learning for Human Activity Recognition
by: Gajanin, Rastko, et al.
Published: (2024)
by: Gajanin, Rastko, et al.
Published: (2024)
Deploying Foundation Model Powered Agent Services: A Survey
by: Xu, Wenchao, et al.
Published: (2024)
by: Xu, Wenchao, et al.
Published: (2024)
Private Virtual Tree Networks for Secure Multi-Tenant Environments Based on the VIRGO Overlay Network
by: Huang, Lican
Published: (2025)
by: Huang, Lican
Published: (2025)
Similar Items
-
ElasticMoE: An Efficient Auto Scaling Method for Mixture-of-Experts Models
by: Singh, Gursimran, et al.
Published: (2025) -
ExpertWeave: Efficiently Serving Expert-Specialized Fine-Tuned Adapters at Scale
by: Shi, Ge, et al.
Published: (2025) -
A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems
by: Zhang, Xiaopei, et al.
Published: (2025) -
Efficiently Serving Large Multimodal Models Using EPD Disaggregation
by: Singh, Gursimran, et al.
Published: (2024) -
Scheduling Deep Learning Jobs in Multi-Tenant GPU Clusters via Wise Resource Sharing
by: Luo, Yizhou, et al.
Published: (2024)