Saved in:
| Main Authors: | Siavashi, Mohammad, Sanaee, Alireza, Sharifi, Mohsen, Antichi, Gianni |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.10923 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
numaPTE: Managing Page-Tables and TLBs on NUMA Systems
by: Gao, Bin, et al.
Published: (2024)
by: Gao, Bin, et al.
Published: (2024)
Fast Userspace Networking for the Rest of Us
by: Sanaee, Alireza, et al.
Published: (2025)
by: Sanaee, Alireza, et al.
Published: (2025)
Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics
by: Moura, Diego, et al.
Published: (2022)
by: Moura, Diego, et al.
Published: (2022)
CARTOS: A Charging-Aware Real-Time Operating System for Intermittent Batteryless Devices
by: Karimi, Mohsen, et al.
Published: (2023)
by: Karimi, Mohsen, et al.
Published: (2023)
Taking the Leap: Efficient and Reliable Fine-Grained NUMA Migration in User-space
by: Schuhknecht, Felix, et al.
Published: (2026)
by: Schuhknecht, Felix, et al.
Published: (2026)
Blink: CPU-Free LLM Inference by Delegating the Serving Stack to GPU and SmartNIC
by: Siavashi, Mohammad, et al.
Published: (2026)
by: Siavashi, Mohammad, et al.
Published: (2026)
Cache is King: Smart Page Eviction with eBPF
by: Zussman, Tal, et al.
Published: (2025)
by: Zussman, Tal, et al.
Published: (2025)
Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration
by: Xiang, Lingfeng, et al.
Published: (2024)
by: Xiang, Lingfeng, et al.
Published: (2024)
TierBPF: Page Migration Admission Control for Tiered Memory via eBPF
by: Wang, Xi, et al.
Published: (2026)
by: Wang, Xi, et al.
Published: (2026)
Exploiting Page Faults for Covert Communication
by: Swaminathan, Sathvik
Published: (2025)
by: Swaminathan, Sathvik
Published: (2025)
ASIC-based Compression Accelerators for Storage Systems: Design, Placement, and Profiling Insights
by: Lu, Tao, et al.
Published: (2025)
by: Lu, Tao, et al.
Published: (2025)
PhoenixOS: Concurrent OS-level GPU Checkpoint and Restore with Validated Speculation
by: Wei, Xingda, et al.
Published: (2024)
by: Wei, Xingda, et al.
Published: (2024)
Ariadne: A Hotness-Aware and Size-Adaptive Compressed Swap Technique for Fast Application Relaunch and Reduced CPU Usage on Mobile Devices
by: Liang, Yu, et al.
Published: (2025)
by: Liang, Yu, et al.
Published: (2025)
Energy-Aware CPU Orchestration in O-RAN: A dApp-Driven Lightweight Approach
by: Crespo, Francisco, et al.
Published: (2025)
by: Crespo, Francisco, et al.
Published: (2025)
SARA: A Stall-Aware Memory Allocation Strategy for Mixed-Criticality Systems
by: Lee, Meng-Chia, et al.
Published: (2025)
by: Lee, Meng-Chia, et al.
Published: (2025)
Valet: Efficient Data Placement on Modern SSDs
by: Purandare, Devashish R., et al.
Published: (2025)
by: Purandare, Devashish R., et al.
Published: (2025)
Principled Performance Tunability in Operating System Kernels
by: Chen, Zhongjie, et al.
Published: (2025)
by: Chen, Zhongjie, et al.
Published: (2025)
ThunderAgent: A Simple, Fast and Program-Aware Agentic Inference System
by: Kang, Hao, et al.
Published: (2026)
by: Kang, Hao, et al.
Published: (2026)
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention
by: Prabhu, Ramya, et al.
Published: (2024)
by: Prabhu, Ramya, et al.
Published: (2024)
DPC: A Distributed Page Cache over CXL
by: Bergman, Shai, et al.
Published: (2026)
by: Bergman, Shai, et al.
Published: (2026)
Dirigent: Lightweight Serverless Orchestration
by: Cvetković, Lazar, et al.
Published: (2024)
by: Cvetković, Lazar, et al.
Published: (2024)
Edge-Based QoS-Aware Adaptive Task Placement: A Closed-Loop Control in Multi-Robot Systems
by: Tran, Thien, et al.
Published: (2026)
by: Tran, Thien, et al.
Published: (2026)
Integrating Artificial Intelligence into Operating Systems: A Survey on Techniques, Applications, and Future Directions
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
LearnedFTL: A Learning-Based Page-Level FTL for Reducing Double Reads in Flash-Based SSDs
by: Wang, Shengzhe, et al.
Published: (2023)
by: Wang, Shengzhe, et al.
Published: (2023)
Rethinking Thread Scheduling under Oversubscription: A User-Space Framework for Coordinating Multi-runtime and Multi-process Workloads
by: Roca, Aleix, et al.
Published: (2026)
by: Roca, Aleix, et al.
Published: (2026)
Defending Event-Triggered Systems against Out-of-Envelope Environments
by: Völp, Marcus, et al.
Published: (2025)
by: Völp, Marcus, et al.
Published: (2025)
Funky: Cloud-Native FPGA Virtualization and Orchestration
by: Koshiba, Atsushi, et al.
Published: (2025)
by: Koshiba, Atsushi, et al.
Published: (2025)
The Missing Memory Hierarchy: Demand Paging for LLM Context Windows
by: Mason, Tony
Published: (2026)
by: Mason, Tony
Published: (2026)
Beyond Control: Exploring Novel File System Objects for Data-Only Attacks on Linux Systems
by: Zhou, Jinmeng, et al.
Published: (2024)
by: Zhou, Jinmeng, et al.
Published: (2024)
RUISA Operational Ecosystem Architecture
by: AL Mohtar, Mouayad
Published: (2026)
by: AL Mohtar, Mouayad
Published: (2026)
Boosting File Systems Elegantly: A Transparent NVM Write-ahead Log for Disk File Systems
by: Wang, Guoyu, et al.
Published: (2024)
by: Wang, Guoyu, et al.
Published: (2024)
HACache: Leveraging Read Performance with Cache in a Heterogeneous Array
by: Liu, Jialin, et al.
Published: (2026)
by: Liu, Jialin, et al.
Published: (2026)
RTP-LLM: High-Performance Alibaba LLM Inference Engine
by: Tan, Boyu, et al.
Published: (2026)
by: Tan, Boyu, et al.
Published: (2026)
Dissecting CXL Memory Performance at Scale: Analysis, Modeling, and Optimization
by: Liu, Jinshu, et al.
Published: (2024)
by: Liu, Jinshu, et al.
Published: (2024)
Iridescent: A Framework Enabling Online System Implementation Specialization
by: Anand, Vaastav, et al.
Published: (2025)
by: Anand, Vaastav, et al.
Published: (2025)
From Good to Great: Improving Memory Tiering Performance Through Parameter Tuning
by: Kanellis, Konstantinos, et al.
Published: (2025)
by: Kanellis, Konstantinos, et al.
Published: (2025)
FRAP: A Flexible Resource Accessing Protocol for Multiprocessor Real-Time Systems
by: Zhao, Shuai, et al.
Published: (2024)
by: Zhao, Shuai, et al.
Published: (2024)
Assessing FIFO and Round Robin Scheduling:Effects on Data Pipeline Performance and Energy Usage
by: Choudhury, Malobika Roy, et al.
Published: (2024)
by: Choudhury, Malobika Roy, et al.
Published: (2024)
Analyzing Configuration Dependencies of File Systems
by: Mahmud, Tabassum, et al.
Published: (2025)
by: Mahmud, Tabassum, et al.
Published: (2025)
The First Principle of Big Memory Systems
by: Hua, Yu
Published: (2023)
by: Hua, Yu
Published: (2023)
Similar Items
-
numaPTE: Managing Page-Tables and TLBs on NUMA Systems
by: Gao, Bin, et al.
Published: (2024) -
Fast Userspace Networking for the Rest of Us
by: Sanaee, Alireza, et al.
Published: (2025) -
Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics
by: Moura, Diego, et al.
Published: (2022) -
CARTOS: A Charging-Aware Real-Time Operating System for Intermittent Batteryless Devices
by: Karimi, Mohsen, et al.
Published: (2023) -
Taking the Leap: Efficient and Reliable Fine-Grained NUMA Migration in User-space
by: Schuhknecht, Felix, et al.
Published: (2026)