:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Zixuan, Wang, Chuanzhen, Sun, Haotian
Format:	Preprint
Published:	2026
Subjects:	Distributed, Parallel, and Cluster Computing
Online Access:	https://arxiv.org/abs/2601.05569
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics
by: Jiang, Youhe, et al.
Published: (2026)

AI-Driven Health Monitoring of Distributed Computing Architecture: Insights from XGBoost and SHAP
by: Sun, Xiaoxuan, et al.
Published: (2024)

Cloud-native and Distributed Systems for Efficient and Scalable Large Language Models -- A Research Agenda
by: Xu, Minxian, et al.
Published: (2026)

Joint Temporal-Structural Representation Learning for Distributed Fault Discrimination in Microservice Architectures
by: Xue, Yihan, et al.
Published: (2026)

Automatic BLAS Offloading on Unified Memory Architecture: A Study on NVIDIA Grace-Hopper
by: Li, Junjie, et al.
Published: (2024)

CondenseGraph: Communication-Efficient Distributed GNN Training via On-the-Fly Graph Condensation
by: Zhang, Zizhao, et al.
Published: (2026)

On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers
by: Lu, Zhengxian, et al.
Published: (2024)

sVIRGO: A Scalable Virtual Tree Hierarchical Framework for Distributed Systems
by: Huang, Lican
Published: (2026)

TD-Orch: Scalable Load-Balancing for Distributed Systems with Applications to Graph Processing
by: Zhao, Yiwei, et al.
Published: (2025)

Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management
by: Yang, Xinjun, et al.
Published: (2025)

DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence
by: Zhang, Hanze, et al.
Published: (2025)

Pilotfish: Distributed Execution for Scalable Blockchains
by: Kniep, Quentin, et al.
Published: (2024)

Towards Efficient and Scalable Distributed Vector Search with RDMA
by: Zhi, Xiangyu, et al.
Published: (2025)

Scalable Systems and Software Architectures for High-Performance Computing on cloud platforms
by: Ramesh, Risshab Srinivas
Published: (2024)

Scalable Distributed Vector Search via Accuracy Preserving Index Construction
by: Xu, Yuming, et al.
Published: (2025)

Performance Trade-offs of High Order Meshless Approximation on Distributed Memory Systems
by: Vehovar, Jon, et al.
Published: (2025)

HybridTier: an Adaptive and Lightweight CXL-Memory Tiering System
by: Song, Kevin, et al.
Published: (2023)

Distributed Renaming with Subquadratic Bits via Scalable Committee Election
by: Bai, Sirui, et al.
Published: (2026)

BlockAMC: Scalable In-Memory Analog Matrix Computing for Solving Linear Systems
by: Pan, Lunshuai, et al.
Published: (2024)

Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler
by: Zheng, Size, et al.
Published: (2025)

Distributed Resource Selection for Self-Organising Cloud-Edge Systems
by: Renau, Quentin, et al.
Published: (2025)

Kairos: A Scalable Serving System for Physical AI
by: Dai, Yinwei, et al.
Published: (2026)

Memory-aware Adaptive Scheduling of Scientific Workflows on Heterogeneous Architectures
by: Kulagina, Svetlana, et al.
Published: (2025)

Beyond A Single AI Cluster: A Survey of Decentralized LLM Training
by: Dong, Haotian, et al.
Published: (2025)

A Scalable Clustered Architecture for Cyber-Physical Systems
by: Cabral, Bernardo
Published: (2024)

exa-AMD: A Scalable Workflow for Accelerating AI-Assisted Materials Discovery and Design
by: Moraru, Maxim, et al.
Published: (2025)

Distributed Log-driven Anomaly Detection System based on Evolving Decision Making
by: Tan, Zhuoran, et al.
Published: (2025)

DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]
by: Lu, Baotong, et al.
Published: (2024)

A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs
by: Zhang, Chen, et al.
Published: (2026)

Enabling Scientific Workflow Scheduling Research in Non-Uniform Memory Access Architectures
by: Vivas, Aurelio, et al.
Published: (2025)

Optimizing Memory Allocation in Distributed Clusters with Predictive Modeling
by: Bader, Jonathan, et al.
Published: (2026)

Computation-Bandwidth-Memory Trade-offs: A Unified Paradigm for AI Infrastructure
by: Fan, Yuankai, et al.
Published: (2025)

Solutions for Distributed Memory Access Mechanism on HPC Clusters
by: Meizner, Jan, et al.
Published: (2025)

Fork is All You Need in Heterogeneous Systems
by: Wang, Zixuan, et al.
Published: (2024)

Akita: A High Usability Simulation Framework for Computer Architecture
by: Jannat, Sabila Al, et al.
Published: (2026)

Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of Distributed Stream Processing Systems
by: Pfister, Benjamin J. J., et al.
Published: (2024)

FLARE: A Dataflow-Aware and Scalable Hardware Architecture for Neural-Hybrid Scientific Lossy Compression
by: Jia, Wenqi, et al.
Published: (2025)

RCOMPSs: A Scalable Runtime System for R Code Execution on Manycore Systems
by: Zhang, Xiran, et al.
Published: (2025)

DistFlow: A Fully Distributed RL Framework for Scalable and Efficient LLM Post-Training
by: Wang, Zhixin, et al.
Published: (2025)

SDSL-Solver: Scalable Distributed Sparse Linear Solvers for Large-Scale Interior Point Methods
by: Yang, Shaofeng, et al.
Published: (2026)