Saved in:
| Main Authors: | Li, Zixuan, Wang, Chuanzhen, Sun, Haotian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.05569 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics
by: Jiang, Youhe, et al.
Published: (2026)
by: Jiang, Youhe, et al.
Published: (2026)
AI-Driven Health Monitoring of Distributed Computing Architecture: Insights from XGBoost and SHAP
by: Sun, Xiaoxuan, et al.
Published: (2024)
by: Sun, Xiaoxuan, et al.
Published: (2024)
Cloud-native and Distributed Systems for Efficient and Scalable Large Language Models -- A Research Agenda
by: Xu, Minxian, et al.
Published: (2026)
by: Xu, Minxian, et al.
Published: (2026)
Joint Temporal-Structural Representation Learning for Distributed Fault Discrimination in Microservice Architectures
by: Xue, Yihan, et al.
Published: (2026)
by: Xue, Yihan, et al.
Published: (2026)
Automatic BLAS Offloading on Unified Memory Architecture: A Study on NVIDIA Grace-Hopper
by: Li, Junjie, et al.
Published: (2024)
by: Li, Junjie, et al.
Published: (2024)
CondenseGraph: Communication-Efficient Distributed GNN Training via On-the-Fly Graph Condensation
by: Zhang, Zizhao, et al.
Published: (2026)
by: Zhang, Zizhao, et al.
Published: (2026)
On the Performance and Memory Footprint of Distributed Training: An Empirical Study on Transformers
by: Lu, Zhengxian, et al.
Published: (2024)
by: Lu, Zhengxian, et al.
Published: (2024)
sVIRGO: A Scalable Virtual Tree Hierarchical Framework for Distributed Systems
by: Huang, Lican
Published: (2026)
by: Huang, Lican
Published: (2026)
TD-Orch: Scalable Load-Balancing for Distributed Systems with Applications to Graph Processing
by: Zhao, Yiwei, et al.
Published: (2025)
by: Zhao, Yiwei, et al.
Published: (2025)
Beluga: A CXL-Based Memory Architecture for Scalable and Efficient LLM KVCache Management
by: Yang, Xinjun, et al.
Published: (2025)
by: Yang, Xinjun, et al.
Published: (2025)
DiFache: Efficient and Scalable Caching on Disaggregated Memory using Decentralized Coherence
by: Zhang, Hanze, et al.
Published: (2025)
by: Zhang, Hanze, et al.
Published: (2025)
Pilotfish: Distributed Execution for Scalable Blockchains
by: Kniep, Quentin, et al.
Published: (2024)
by: Kniep, Quentin, et al.
Published: (2024)
Towards Efficient and Scalable Distributed Vector Search with RDMA
by: Zhi, Xiangyu, et al.
Published: (2025)
by: Zhi, Xiangyu, et al.
Published: (2025)
Scalable Systems and Software Architectures for High-Performance Computing on cloud platforms
by: Ramesh, Risshab Srinivas
Published: (2024)
by: Ramesh, Risshab Srinivas
Published: (2024)
Scalable Distributed Vector Search via Accuracy Preserving Index Construction
by: Xu, Yuming, et al.
Published: (2025)
by: Xu, Yuming, et al.
Published: (2025)
Performance Trade-offs of High Order Meshless Approximation on Distributed Memory Systems
by: Vehovar, Jon, et al.
Published: (2025)
by: Vehovar, Jon, et al.
Published: (2025)
HybridTier: an Adaptive and Lightweight CXL-Memory Tiering System
by: Song, Kevin, et al.
Published: (2023)
by: Song, Kevin, et al.
Published: (2023)
Distributed Renaming with Subquadratic Bits via Scalable Committee Election
by: Bai, Sirui, et al.
Published: (2026)
by: Bai, Sirui, et al.
Published: (2026)
BlockAMC: Scalable In-Memory Analog Matrix Computing for Solving Linear Systems
by: Pan, Lunshuai, et al.
Published: (2024)
by: Pan, Lunshuai, et al.
Published: (2024)
Triton-distributed: Programming Overlapping Kernels on Distributed AI Systems with the Triton Compiler
by: Zheng, Size, et al.
Published: (2025)
by: Zheng, Size, et al.
Published: (2025)
Distributed Resource Selection for Self-Organising Cloud-Edge Systems
by: Renau, Quentin, et al.
Published: (2025)
by: Renau, Quentin, et al.
Published: (2025)
Kairos: A Scalable Serving System for Physical AI
by: Dai, Yinwei, et al.
Published: (2026)
by: Dai, Yinwei, et al.
Published: (2026)
Memory-aware Adaptive Scheduling of Scientific Workflows on Heterogeneous Architectures
by: Kulagina, Svetlana, et al.
Published: (2025)
by: Kulagina, Svetlana, et al.
Published: (2025)
Beyond A Single AI Cluster: A Survey of Decentralized LLM Training
by: Dong, Haotian, et al.
Published: (2025)
by: Dong, Haotian, et al.
Published: (2025)
A Scalable Clustered Architecture for Cyber-Physical Systems
by: Cabral, Bernardo
Published: (2024)
by: Cabral, Bernardo
Published: (2024)
exa-AMD: A Scalable Workflow for Accelerating AI-Assisted Materials Discovery and Design
by: Moraru, Maxim, et al.
Published: (2025)
by: Moraru, Maxim, et al.
Published: (2025)
Distributed Log-driven Anomaly Detection System based on Evolving Decision Making
by: Tan, Zhuoran, et al.
Published: (2025)
by: Tan, Zhuoran, et al.
Published: (2025)
DEX: Scalable Range Indexing on Disaggregated Memory [Extended Version]
by: Lu, Baotong, et al.
Published: (2024)
by: Lu, Baotong, et al.
Published: (2024)
A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs
by: Zhang, Chen, et al.
Published: (2026)
by: Zhang, Chen, et al.
Published: (2026)
Enabling Scientific Workflow Scheduling Research in Non-Uniform Memory Access Architectures
by: Vivas, Aurelio, et al.
Published: (2025)
by: Vivas, Aurelio, et al.
Published: (2025)
Optimizing Memory Allocation in Distributed Clusters with Predictive Modeling
by: Bader, Jonathan, et al.
Published: (2026)
by: Bader, Jonathan, et al.
Published: (2026)
Computation-Bandwidth-Memory Trade-offs: A Unified Paradigm for AI Infrastructure
by: Fan, Yuankai, et al.
Published: (2025)
by: Fan, Yuankai, et al.
Published: (2025)
Solutions for Distributed Memory Access Mechanism on HPC Clusters
by: Meizner, Jan, et al.
Published: (2025)
by: Meizner, Jan, et al.
Published: (2025)
Fork is All You Need in Heterogeneous Systems
by: Wang, Zixuan, et al.
Published: (2024)
by: Wang, Zixuan, et al.
Published: (2024)
Akita: A High Usability Simulation Framework for Computer Architecture
by: Jannat, Sabila Al, et al.
Published: (2026)
by: Jannat, Sabila Al, et al.
Published: (2026)
Daedalus: Self-Adaptive Horizontal Autoscaling for Resource Efficiency of Distributed Stream Processing Systems
by: Pfister, Benjamin J. J., et al.
Published: (2024)
by: Pfister, Benjamin J. J., et al.
Published: (2024)
FLARE: A Dataflow-Aware and Scalable Hardware Architecture for Neural-Hybrid Scientific Lossy Compression
by: Jia, Wenqi, et al.
Published: (2025)
by: Jia, Wenqi, et al.
Published: (2025)
RCOMPSs: A Scalable Runtime System for R Code Execution on Manycore Systems
by: Zhang, Xiran, et al.
Published: (2025)
by: Zhang, Xiran, et al.
Published: (2025)
DistFlow: A Fully Distributed RL Framework for Scalable and Efficient LLM Post-Training
by: Wang, Zhixin, et al.
Published: (2025)
by: Wang, Zhixin, et al.
Published: (2025)
SDSL-Solver: Scalable Distributed Sparse Linear Solvers for Large-Scale Interior Point Methods
by: Yang, Shaofeng, et al.
Published: (2026)
by: Yang, Shaofeng, et al.
Published: (2026)
Similar Items
-
Autopoiesis: A Self-Evolving System Paradigm for LLM Serving Under Runtime Dynamics
by: Jiang, Youhe, et al.
Published: (2026) -
AI-Driven Health Monitoring of Distributed Computing Architecture: Insights from XGBoost and SHAP
by: Sun, Xiaoxuan, et al.
Published: (2024) -
Cloud-native and Distributed Systems for Efficient and Scalable Large Language Models -- A Research Agenda
by: Xu, Minxian, et al.
Published: (2026) -
Joint Temporal-Structural Representation Learning for Distributed Fault Discrimination in Microservice Architectures
by: Xue, Yihan, et al.
Published: (2026) -
Automatic BLAS Offloading on Unified Memory Architecture: A Study on NVIDIA Grace-Hopper
by: Li, Junjie, et al.
Published: (2024)