Saved in:
| Main Authors: | Liu, Yueji, Jin, Jun, Shu, Wenhui, Li, Shiyong, He, Yongzhan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.01460 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Bring-Your-Own-Model Approach for ML-Driven Storage Placement in Warehouse-Scale Computers
by: Yang, Chenxi, et al.
Published: (2025)
by: Yang, Chenxi, et al.
Published: (2025)
SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures
by: Giannoula, Christina, et al.
Published: (2024)
by: Giannoula, Christina, et al.
Published: (2024)
Blockchain-aided wireless federated learning: Resource allocation and client scheduling
by: Li, Jun, et al.
Published: (2024)
by: Li, Jun, et al.
Published: (2024)
Optimising Virtual Resource Mapping in Multi-Level NUMA Disaggregated Systems
by: Lakew, Ewnetu Bayuh, et al.
Published: (2025)
by: Lakew, Ewnetu Bayuh, et al.
Published: (2025)
ESS: An Offload-Centric Latent-Cache Management Architecture for DeepSeek-V3.2-Exp
by: Chen, Xinhang, et al.
Published: (2025)
by: Chen, Xinhang, et al.
Published: (2025)
Overview and Prospects of Using Integer Surrogate Keys for Data Warehouse Performance Optimization
by: Stumpf, Sviatoslav, et al.
Published: (2025)
by: Stumpf, Sviatoslav, et al.
Published: (2025)
Distributed On-Device LLM Inference With Over-the-Air Computation
by: Zhang, Kai, et al.
Published: (2025)
by: Zhang, Kai, et al.
Published: (2025)
Carbon: Scaling Trusted Payments with Untrusted Machines
by: Camaioni, Martina, et al.
Published: (2022)
by: Camaioni, Martina, et al.
Published: (2022)
CCRSat: A Collaborative Computation Reuse Framework for Satellite Edge Computing Networks
by: Zhang, Ye, et al.
Published: (2025)
by: Zhang, Ye, et al.
Published: (2025)
Exploring Uncore Frequency Scaling for Heterogeneous Computing
by: Zheng, Zhong, et al.
Published: (2025)
by: Zheng, Zhong, et al.
Published: (2025)
KUBEDIRECT: Unleashing the Full Power of the Cluster Manager for Serverless Computing
by: Qi, Sheng, et al.
Published: (2026)
by: Qi, Sheng, et al.
Published: (2026)
DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving
by: Zhong, Yinmin, et al.
Published: (2024)
by: Zhong, Yinmin, et al.
Published: (2024)
EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution
by: Bourgeois, Daniel, et al.
Published: (2024)
by: Bourgeois, Daniel, et al.
Published: (2024)
Scalable Analysis of Urban Scaling Laws: Leveraging Cloud Computing to Analyze 21,280 Global Cities
by: Li, Zhenhui, et al.
Published: (2024)
by: Li, Zhenhui, et al.
Published: (2024)
Large-Scale Metric Computation in Online Controlled Experiment Platform
by: Xiong, Tao, et al.
Published: (2024)
by: Xiong, Tao, et al.
Published: (2024)
MPI-Q: A Message Communication Library for Large-Scale Classical-Quantum Heterogeneous Hybrid Distributed Computing
by: Wang, Feng, et al.
Published: (2026)
by: Wang, Feng, et al.
Published: (2026)
WaterWise: Co-optimizing Carbon- and Water-Footprint Toward Environmentally Sustainable Cloud Computing
by: Jiang, Yankai, et al.
Published: (2025)
by: Jiang, Yankai, et al.
Published: (2025)
EdgeFaaS: A Function-based Framework for Edge Computing
by: Jin, Runyu, et al.
Published: (2022)
by: Jin, Runyu, et al.
Published: (2022)
Driving Computational Efficiency in Large-Scale Platforms using HPC Technologies
by: Mendez, Alexander Martinez, et al.
Published: (2026)
by: Mendez, Alexander Martinez, et al.
Published: (2026)
Scale: Deep Reinforcement Learning for Container Scheduling in Serverless Edge Computing
by: Chen, Chen, et al.
Published: (2026)
by: Chen, Chen, et al.
Published: (2026)
FlashFuser: Expanding the Scale of Kernel Fusion for Compute-Intensive Operators via Inter-Core Connection
by: Huang, Ziyu, et al.
Published: (2025)
by: Huang, Ziyu, et al.
Published: (2025)
Approximated Coded Computing: Towards Fast, Private and Secure Distributed Machine Learning
by: Qiu, Houming, et al.
Published: (2024)
by: Qiu, Houming, et al.
Published: (2024)
Efficient calculation of available space for multi-NUMA virtual machines
by: Gudkov, Andrei, et al.
Published: (2026)
by: Gudkov, Andrei, et al.
Published: (2026)
WindGP: Efficient Graph Partitioning on Heterogenous Machines
by: Zeng, Li, et al.
Published: (2024)
by: Zeng, Li, et al.
Published: (2024)
Profiling and optimization of multi-card GPU machine learning jobs
by: Lawenda, Marcin, et al.
Published: (2025)
by: Lawenda, Marcin, et al.
Published: (2025)
Low-Latency Federated Fine-Tuning for Large Language Models Over Wireless Networks
by: Pang, Zhiwen, et al.
Published: (2026)
by: Pang, Zhiwen, et al.
Published: (2026)
PipeBoost: Resilient Pipelined Architecture for Fast Serverless LLM Scaling
by: Liu, Chongpeng, et al.
Published: (2025)
by: Liu, Chongpeng, et al.
Published: (2025)
Astra: Efficient and Money-saving Automatic Parallel Strategies Search on Heterogeneous GPUs
by: Wang, Peiran, et al.
Published: (2025)
by: Wang, Peiran, et al.
Published: (2025)
Polar: Agentic RL on Any Harness at Scale
by: Xu, Binfeng, et al.
Published: (2026)
by: Xu, Binfeng, et al.
Published: (2026)
HGraphScale: Hierarchical Graph Learning for Autoscaling Microservice Applications in Container-based Cloud Computing
by: Fang, Zhengxin, et al.
Published: (2025)
by: Fang, Zhengxin, et al.
Published: (2025)
Incremental GNN Embedding Computation on Streaming Graphs
by: Wang, Qiange, et al.
Published: (2026)
by: Wang, Qiange, et al.
Published: (2026)
A Distributed Approach for Persistent Homology Computation on a Large Scale
by: Ceccaroni, Riccardo, et al.
Published: (2024)
by: Ceccaroni, Riccardo, et al.
Published: (2024)
Akita: A High Usability Simulation Framework for Computer Architecture
by: Jannat, Sabila Al, et al.
Published: (2026)
by: Jannat, Sabila Al, et al.
Published: (2026)
TileLink: Generating Efficient Compute-Communication Overlapping Kernels using Tile-Centric Primitives
by: Zheng, Size, et al.
Published: (2025)
by: Zheng, Size, et al.
Published: (2025)
Building State Machine Replication Using Practical Network Synchrony
by: Wan, Yiliang, et al.
Published: (2025)
by: Wan, Yiliang, et al.
Published: (2025)
Advancing Anomaly Detection in Computational Workflows with Active Learning
by: Raghavan, Krishnan, et al.
Published: (2024)
by: Raghavan, Krishnan, et al.
Published: (2024)
Scaling LLM Inference Beyond Amdahl`s Limits via Eliminating Non-Scalable Overheads
by: Zhao, Alan, et al.
Published: (2026)
by: Zhao, Alan, et al.
Published: (2026)
Mean field optimal Core Allocation across Malleable jobs
by: Li, Zhouzi, et al.
Published: (2026)
by: Li, Zhouzi, et al.
Published: (2026)
DeepServe: Serverless Large Language Model Serving at Scale
by: Hu, Junhao, et al.
Published: (2025)
by: Hu, Junhao, et al.
Published: (2025)
EvoSort: A Genetic-Algorithm-Based Adaptive Parallel Sorting Framework for Large-Scale High Performance Computing
by: Raj, Shashank, et al.
Published: (2025)
by: Raj, Shashank, et al.
Published: (2025)
Similar Items
-
A Bring-Your-Own-Model Approach for ML-Driven Storage Placement in Warehouse-Scale Computers
by: Yang, Chenxi, et al.
Published: (2025) -
SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures
by: Giannoula, Christina, et al.
Published: (2024) -
Blockchain-aided wireless federated learning: Resource allocation and client scheduling
by: Li, Jun, et al.
Published: (2024) -
Optimising Virtual Resource Mapping in Multi-Level NUMA Disaggregated Systems
by: Lakew, Ewnetu Bayuh, et al.
Published: (2025) -
ESS: An Offload-Centric Latent-Cache Management Architecture for DeepSeek-V3.2-Exp
by: Chen, Xinhang, et al.
Published: (2025)