:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Cong, Zheng, Yusheng
Format:	Preprint
Published:	2026
Subjects:	Operating Systems Distributed, Parallel, and Cluster Computing
Online Access:	https://arxiv.org/abs/2602.08199
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GPUOS: A GPU Operating System Primitive for Transparent Operation Fusion
by: Yang, Yiwei, et al.
Published: (2026)

NCCLbpf: Verified, Composable Policy Execution for GPU Collective Communication
by: Zheng, Yusheng
Published: (2026)

Toward Systems Foundations for Agentic Exploration
by: Xu, Jiakai, et al.
Published: (2025)

PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation
by: Wei, Xingda, et al.
Published: (2024)

Taming Serverless Cold Starts Through OS Co-Design
by: Holmes, Ben, et al.
Published: (2025)

Demystifying Serverless Costs on Public Platforms: Bridging Billing, Architecture, and OS Scheduling
by: Lin, Changyuan, et al.
Published: (2025)

Performance Isolation and Semantic Determinism in Efficient GPU Spatial Sharing
by: Yang, Zhenyuan, et al.
Published: (2026)

MARS: Efficient, Adaptive Co-Scheduling for Heterogeneous Agentic Systems
by: Wang, Yifei, et al.
Published: (2026)

TrEnv-X: Transparently Share Serverless Execution Environments Across Different Functions and Nodes
by: Huang, Jialiang, et al.
Published: (2025)

Formal Definitions and Performance Comparison of Consistency Models for Parallel File Systems
by: Wang, Chen, et al.
Published: (2024)

Agent Centric Operating System -- a Comprehensive Review and Outlook for Operating System
by: Jia, Shian, et al.
Published: (2024)

FALCON: Pinpointing and Mitigating Stragglers for Large-Scale Hybrid-Parallel Training
by: Wu, Tianyuan, et al.
Published: (2024)

"Range as a Key" is the Key! Fast and Compact Cloud Block Store Index with RASK
by: Zhao, Haoru, et al.
Published: (2026)

Nixie: Efficient, Transparent Temporal Multiplexing for Consumer GPUs
by: Xu, Yechen, et al.
Published: (2026)

TIDAL: Recovering Temporal Phase for Cloud Block Storage Placement from LLM-Derived Semantics
by: Tan, Difan, et al.
Published: (2026)

HybridTier: an Adaptive and Lightweight CXL-Memory Tiering System
by: Song, Kevin, et al.
Published: (2023)

BLITZSCALE: Fast and Live Large Model Autoscaling with O(1) Host Caching
by: Zhang, Dingyan, et al.
Published: (2024)

Nanvix: A Multikernel OS Design for High-Density Serverless Deployments
by: Segarra, Carlos, et al.
Published: (2026)

DPC: A Distributed Page Cache over CXL
by: Bergman, Shai, et al.
Published: (2026)

EdgeFlow: Fast Cold Starts for LLMs on Mobile Devices
by: Yan, Yongsheng, et al.
Published: (2026)

A Periodic Space of Distributed Computing: Vision & Framework
by: Salehi, Mohsen Amini, et al.
Published: (2026)

Equilibria: Fair Multi-Tenant CXL Memory Tiering At Scale
by: Zhao, Kaiyang, et al.
Published: (2026)

CvxCluster: Solving Large, Complex, Granular Resource Allocation Problems 100-1000x Faster
by: Nnorom Jr, Obi, et al.
Published: (2026)

Ensuring Data Freshness in Multi-Rate Task Chains Scheduling
by: Hoffmann, José Luis Conradi, et al.
Published: (2026)

Rethinking Inter-Process Communication with Memory Operation Offloading
by: Park, Misun, et al.
Published: (2026)

Why iCloud Fails: The Category Mistake of Cloud Synchronization
by: Borrill, Paul
Published: (2026)

Rethinking Thread Scheduling under Oversubscription: A User-Space Framework for Coordinating Multi-runtime and Multi-process Workloads
by: Roca, Aleix, et al.
Published: (2026)

Peformance Isolation for Inference Processes in Edge GPU Systems
by: Martín, Juan José, et al.
Published: (2026)

Characterizing Metastable Faults and Failures
by: Farahbakhsh, Ali, et al.
Published: (2026)

ContiguousKV: Accelerating LLM Prefill with Granularity-Aligned KV Cache Management
by: Zou, Jing, et al.
Published: (2026)

Idiosyncrasies of Programmable Caching Engines
by: Peixoto, José, et al.
Published: (2026)

Nexus: Transparent I/O Offloading for High-Density Serverless Computing
by: Park, JooYoung, et al.
Published: (2026)

EdgeWeaver: Accelerating IoT Application Development Across Edge-Cloud Continuum
by: Lertpongrujikorn, Pawissanutt, et al.
Published: (2026)

LMetric: Simple is Better - Multiplication May Be All You Need for LLM Request Scheduling
by: Zhang, Dingyan, et al.
Published: (2026)

Mitigating context switching in densely packed Linux clusters with Latency-Aware Group Scheduling
by: Isstaif, Al Amjad Tawfiq, et al.
Published: (2025)

Unlocking True Elasticity for the Cloud-Native Era with Dandelion
by: Kuchler, Tom, et al.
Published: (2025)

THEMIS: Time, Heterogeneity, and Energy Minded Scheduling for Fair Multi-Tenant Use in FPGAs
by: Karabulut, Emre, et al.
Published: (2024)

Mewz: Lightweight Execution Environment for WebAssembly with High Isolation and Portability using Unikernels
by: Ueda, Soichiro, et al.
Published: (2024)

Fix: externalizing network I/O in serverless computing
by: Deng, Yuhan, et al.
Published: (2025)

RAGDoll: Efficient Offloading-based Online RAG System on a Single GPU
by: Yu, Weiping, et al.
Published: (2025)