Saved in:
| Main Authors: | Wang, Xin, Shen, Hong, Tian, Hui, Tao, Ye |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.22146 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scheduling Coflows in Multi-Core OCS Networks with Performance Guarantee
by: Wang, Xin, et al.
Published: (2026)
by: Wang, Xin, et al.
Published: (2026)
Multi-Source Coflow Scheduling in Collaborative Edge Computing with Multihop Network
by: Sahni, Yuvraj, et al.
Published: (2024)
by: Sahni, Yuvraj, et al.
Published: (2024)
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs
by: Hui, Xinning, et al.
Published: (2024)
by: Hui, Xinning, et al.
Published: (2024)
RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs
by: Xie, Xi, et al.
Published: (2024)
by: Xie, Xi, et al.
Published: (2024)
CoEdge-RAG: Optimizing Hierarchical Scheduling for Retrieval-Augmented LLMs in Collaborative Edge Computing
by: Hong, Guihang, et al.
Published: (2025)
by: Hong, Guihang, et al.
Published: (2025)
A Poly-Log Approximation for Transaction Scheduling in Fog-Cloud Computing and Beyond
by: Adhikari, Ramesh, et al.
Published: (2025)
by: Adhikari, Ramesh, et al.
Published: (2025)
A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems
by: Zhang, Xiaopei, et al.
Published: (2025)
by: Zhang, Xiaopei, et al.
Published: (2025)
MemFine: Memory-Aware Fine-Grained Scheduling for MoE Training
by: Zhao, Lu, et al.
Published: (2025)
by: Zhao, Lu, et al.
Published: (2025)
Eva: Cost-Efficient Cloud-Based Cluster Scheduling
by: Chang, Tzu-Tao, et al.
Published: (2025)
by: Chang, Tzu-Tao, et al.
Published: (2025)
Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework
by: Zhang, Boyuan, et al.
Published: (2024)
by: Zhang, Boyuan, et al.
Published: (2024)
Stream-K Optimization and Exploration
by: Rackley, Nick, et al.
Published: (2024)
by: Rackley, Nick, et al.
Published: (2024)
Accelerating Mixed-Precision Out-of-Core Cholesky Factorization with Static Task Scheduling
by: Ren, Jie, et al.
Published: (2024)
by: Ren, Jie, et al.
Published: (2024)
Learning to Schedule: A Supervised Learning Framework for Network-Aware Scheduling of Data-Intensive Workloads
by: Timilsina, Sankalpa, et al.
Published: (2025)
by: Timilsina, Sankalpa, et al.
Published: (2025)
SwitchFS: Asynchronous Metadata Updates for Distributed Filesystems with In-Network Coordination
by: Xu, Jingwei, et al.
Published: (2024)
by: Xu, Jingwei, et al.
Published: (2024)
iDDS: Intelligent Distributed Dispatch and Scheduling for Workflow Orchestration
by: Guan, Wen, et al.
Published: (2025)
by: Guan, Wen, et al.
Published: (2025)
A Real-Time Digital Twin for Adaptive Scheduling
by: Zhang, Yihe, et al.
Published: (2025)
by: Zhang, Yihe, et al.
Published: (2025)
Communication-Efficient Collaborative LLM Inference over LEO Satellite Networks
by: Zhang, Songge, et al.
Published: (2026)
by: Zhang, Songge, et al.
Published: (2026)
Efficient Circuit Cutting and Scheduling in a Multi-Node Quantum System with Dynamic EPR Pairs
by: Du, Zefan, et al.
Published: (2024)
by: Du, Zefan, et al.
Published: (2024)
SoK: Consensus for Fair Message Ordering
by: Li, Zhuolun, et al.
Published: (2024)
by: Li, Zhuolun, et al.
Published: (2024)
SoK: DAG-based Consensus Protocols
by: Raikwar, Mayank, et al.
Published: (2024)
by: Raikwar, Mayank, et al.
Published: (2024)
PWDFT-SW: Extending the Limit of Plane-Wave DFT Calculations to 16K Atoms on the New Sunway Supercomputer
by: Jiang, Qingcai, et al.
Published: (2024)
by: Jiang, Qingcai, et al.
Published: (2024)
LRScheduler: A Layer-aware and Resource-adaptive Container Scheduler in Edge Computing
by: Tang, Zhiqing, et al.
Published: (2025)
by: Tang, Zhiqing, et al.
Published: (2025)
PROSERVE: Unified Multi-Priority Request Scheduling for LLM Serving
by: Huang, Weizhe, et al.
Published: (2025)
by: Huang, Weizhe, et al.
Published: (2025)
SLO-Aware Scheduling for Large Language Model Inferences
by: Huang, Jinqi, et al.
Published: (2025)
by: Huang, Jinqi, et al.
Published: (2025)
Metronome: Efficient Scheduling for Periodic Traffic Jobs with Network and Priority Awareness
by: Jiang, Hao, et al.
Published: (2025)
by: Jiang, Hao, et al.
Published: (2025)
Hyperion: Hierarchical Scheduling for Parallel LLM Acceleration in Multi-tier Networks
by: Ma, Mulei, et al.
Published: (2025)
by: Ma, Mulei, et al.
Published: (2025)
Collaborative Resource Management and Workloads Scheduling in Cloud-Assisted Mobile Edge Computing across Timescales
by: Tang, Lujie, et al.
Published: (2024)
by: Tang, Lujie, et al.
Published: (2024)
EAT: QoS-Aware Edge-Collaborative AIGC Task Scheduling via Attention-Guided Diffusion Reinforcement Learning
by: Xu, Zhifei, et al.
Published: (2025)
by: Xu, Zhifei, et al.
Published: (2025)
Flash-KMeans: Fast and Memory-Efficient Exact K-Means
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
DCSim: Computing and Networking Integration based Container Scheduling Simulator for Data Centers
by: Hu, Jinlong, et al.
Published: (2024)
by: Hu, Jinlong, et al.
Published: (2024)
QoE-oriented Dependent Task Scheduling under Multi-dimensional QoS Constraints over Distributed Networks
by: Fan, Xuwei, et al.
Published: (2023)
by: Fan, Xuwei, et al.
Published: (2023)
Deep Reinforcement Learning-based Methods for Resource Scheduling in Cloud Computing: A Review and Future Directions
by: Zhou, Guangyao, et al.
Published: (2021)
by: Zhou, Guangyao, et al.
Published: (2021)
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
by: Xu, Changfu, et al.
Published: (2024)
by: Xu, Changfu, et al.
Published: (2024)
Adaptive K-PackCache: Cost-Centric Data Caching in Cloud
by: Sarkar, Suvarthi, et al.
Published: (2025)
by: Sarkar, Suvarthi, et al.
Published: (2025)
Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM
by: Zhang, Yao, et al.
Published: (2025)
by: Zhang, Yao, et al.
Published: (2025)
IsoSched: Preemptive Tile Cascaded Scheduling of Multi-DNN via Subgraph Isomorphism
by: Zhao, Boran, et al.
Published: (2025)
by: Zhao, Boran, et al.
Published: (2025)
Revisiting the Schedule Graph Generation for the Exact and Sustainable Analysis of Non-preemptive Scheduling
by: Vlk, Marek, et al.
Published: (2024)
by: Vlk, Marek, et al.
Published: (2024)
Taming Request Imbalance: SLO-Aware Scheduling for Disaggregated LLM Inference
by: Wang, Qipeng
Published: (2026)
by: Wang, Qipeng
Published: (2026)
Popcorn: Accelerating Kernel K-means on GPUs through Sparse Linear Algebra
by: Bellavita, Julian, et al.
Published: (2025)
by: Bellavita, Julian, et al.
Published: (2025)
CCRSat: A Collaborative Computation Reuse Framework for Satellite Edge Computing Networks
by: Zhang, Ye, et al.
Published: (2025)
by: Zhang, Ye, et al.
Published: (2025)
Similar Items
-
Scheduling Coflows in Multi-Core OCS Networks with Performance Guarantee
by: Wang, Xin, et al.
Published: (2026) -
Multi-Source Coflow Scheduling in Collaborative Edge Computing with Multihop Network
by: Sahni, Yuvraj, et al.
Published: (2024) -
ESG: Pipeline-Conscious Efficient Scheduling of DNN Workflows on Serverless Platforms with Shareable GPUs
by: Hui, Xinning, et al.
Published: (2024) -
RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs
by: Xie, Xi, et al.
Published: (2024) -
CoEdge-RAG: Optimizing Hierarchical Scheduling for Retrieval-Augmented LLMs in Collaborative Edge Computing
by: Hong, Guihang, et al.
Published: (2025)