Saved in:
| Main Author: | Gruber, Bernhard Manfred |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2302.08251 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Examem: Low-Overhead Memory Instrumentation for Intelligent Memory Systems
by: Poduval, Ashwin, et al.
Published: (2024)
by: Poduval, Ashwin, et al.
Published: (2024)
FB$^+$-tree: A Memory-Optimized B$^+$-tree with Latch-Free Update
by: Chen, Yuan, et al.
Published: (2025)
by: Chen, Yuan, et al.
Published: (2025)
Machine Learning-Guided Memory Optimization for DLRM Inference on Tiered Memory
by: Ren, Jie, et al.
Published: (2025)
by: Ren, Jie, et al.
Published: (2025)
Tuning Fast Memory Size based on Modeling of Page Migration for Tiered Memory
by: Chen, Shangye, et al.
Published: (2024)
by: Chen, Shangye, et al.
Published: (2024)
Heterogeneous Memory Pool Tuning
by: Vaverka, Filip, et al.
Published: (2025)
by: Vaverka, Filip, et al.
Published: (2025)
Multi-Strided Access Patterns to Boost Hardware Prefetching
by: Blom, Miguel O., et al.
Published: (2024)
by: Blom, Miguel O., et al.
Published: (2024)
Columbo: Low Level End-to-End System Traces through Modular Full-System Simulation
by: Görgen, Jakob, et al.
Published: (2024)
by: Görgen, Jakob, et al.
Published: (2024)
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
by: Shao, Zishan, et al.
Published: (2025)
by: Shao, Zishan, et al.
Published: (2025)
Modeling Utilization to Identify Shared-Memory Atomic Bottlenecks
by: Dong, Rongcui, et al.
Published: (2025)
by: Dong, Rongcui, et al.
Published: (2025)
Update!
by: Heller, Franziska
Published: (2020)
by: Heller, Franziska
Published: (2020)
EDAN: Towards Understanding Memory Parallelism and Latency Sensitivity in HPC
by: Shen, Siyuan, et al.
Published: (2025)
by: Shen, Siyuan, et al.
Published: (2025)
Optimizing CPU Cache Utilization in Cloud VMs with Accurate Cache Abstraction
by: Tofigh, Mani, et al.
Published: (2025)
by: Tofigh, Mani, et al.
Published: (2025)
CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion
by: Laukemann, Jan, et al.
Published: (2023)
by: Laukemann, Jan, et al.
Published: (2023)
Analysis and Evaluation of Using Microsecond-Latency Memory for In-Memory Indices and Caches in SSD-Based Key-Value Stores
by: Bando, Yosuke, et al.
Published: (2025)
by: Bando, Yosuke, et al.
Published: (2025)
WritePolicyBench: Benchmarking Memory Write Policies under Byte Budgets
by: Cham, Edgard El
Published: (2026)
by: Cham, Edgard El
Published: (2026)
MultiPath Memory Access: Breaking Host-GPU Bandwidth Bottlenecks in LLM Services
by: Tang, Lingfeng, et al.
Published: (2025)
by: Tang, Lingfeng, et al.
Published: (2025)
Repr Types: One Abstraction to Rule Them All
by: Palmkvist, Viktor, et al.
Published: (2024)
by: Palmkvist, Viktor, et al.
Published: (2024)
Meta-Metrics and Best Practices for System-Level Inference Performance Benchmarking
by: Salaria, Shweta, et al.
Published: (2025)
by: Salaria, Shweta, et al.
Published: (2025)
Sampling in Cloud Benchmarking: A Critical Review and Methodological Guidelines
by: Akbari, Saman, et al.
Published: (2025)
by: Akbari, Saman, et al.
Published: (2025)
Hiku: Pull-Based Scheduling for Serverless Computing
by: Akbari, Saman, et al.
Published: (2025)
by: Akbari, Saman, et al.
Published: (2025)
Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing
by: Akbari, Saman, et al.
Published: (2025)
by: Akbari, Saman, et al.
Published: (2025)
Count-Min Sketch with Conservative Updates: Worst-Case Analysis
by: Mazziane, Younes Ben, et al.
Published: (2024)
by: Mazziane, Younes Ben, et al.
Published: (2024)
Heterogeneous Memory Benchmarking Toolkit
by: Ghaemi, Golsana, et al.
Published: (2025)
by: Ghaemi, Golsana, et al.
Published: (2025)
Heterogeneous Data Access Model for Concurrency Control and Methods to Deal with High Data Contention
by: Thomasian, Alexander
Published: (2024)
by: Thomasian, Alexander
Published: (2024)
Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors
by: Sehgal, Rohit, et al.
Published: (2024)
by: Sehgal, Rohit, et al.
Published: (2024)
SparseX: Efficient Segment-Level KV Cache Sharing for Interleaved LLM Serving
by: Zhang, Quqing, et al.
Published: (2026)
by: Zhang, Quqing, et al.
Published: (2026)
Memshare: Memory Sharing for Multicore Computation in R with an Application to Feature Selection by Mutual Information using PDE
by: Thrun, Michael C., et al.
Published: (2025)
by: Thrun, Michael C., et al.
Published: (2025)
Dependence-Driven, Scalable Quantum Circuit Mapping with Affine Abstractions
by: Benbetka, Marouane, et al.
Published: (2025)
by: Benbetka, Marouane, et al.
Published: (2025)
Unikernels vs. Containers: A Runtime-Level Performance Comparison for Resource-Constrained Edge Workloads
by: Dinh-Tuan, Hai
Published: (2025)
by: Dinh-Tuan, Hai
Published: (2025)
HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
by: Huang, Haochen, et al.
Published: (2025)
by: Huang, Haochen, et al.
Published: (2025)
Virtual-Memory Powersort
by: Moltmann, Finn, et al.
Published: (2026)
by: Moltmann, Finn, et al.
Published: (2026)
Lagrange Index based Scheduling for Minimizing Age of Updates from Heterogeneous Sources
by: Mukherjee, Aniket, et al.
Published: (2026)
by: Mukherjee, Aniket, et al.
Published: (2026)
A Controlled Study of Memory Hierarchy Transitions in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture
by: Pratipat, Gyan
Published: (2026)
by: Pratipat, Gyan
Published: (2026)
Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs
by: Owen, Herbert, et al.
Published: (2024)
by: Owen, Herbert, et al.
Published: (2024)
Memory Analysis on the Training Course of DeepSeek Models
by: Zhang, Ping, et al.
Published: (2025)
by: Zhang, Ping, et al.
Published: (2025)
Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics
by: Moura, Diego, et al.
Published: (2022)
by: Moura, Diego, et al.
Published: (2022)
ETM2: Empowering Traditional Memory Bandwidth Regulation using ETM
by: Zuepke, Alexander, et al.
Published: (2026)
by: Zuepke, Alexander, et al.
Published: (2026)
Delegation with Trust<T>: A Scalable, Type- and Memory-Safe Alternative to Locks
by: Ahmad, Noaman, et al.
Published: (2024)
by: Ahmad, Noaman, et al.
Published: (2024)
A$^3$PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader
by: Jiang, Qingcai, et al.
Published: (2024)
by: Jiang, Qingcai, et al.
Published: (2024)
Reducing Compute Waste in LLMs through Kernel-Level DVFS
by: Spaan, Jeffrey, et al.
Published: (2026)
by: Spaan, Jeffrey, et al.
Published: (2026)
Similar Items
-
Examem: Low-Overhead Memory Instrumentation for Intelligent Memory Systems
by: Poduval, Ashwin, et al.
Published: (2024) -
FB$^+$-tree: A Memory-Optimized B$^+$-tree with Latch-Free Update
by: Chen, Yuan, et al.
Published: (2025) -
Machine Learning-Guided Memory Optimization for DLRM Inference on Tiered Memory
by: Ren, Jie, et al.
Published: (2025) -
Tuning Fast Memory Size based on Modeling of Page Migration for Tiered Memory
by: Chen, Shangye, et al.
Published: (2024) -
Heterogeneous Memory Pool Tuning
by: Vaverka, Filip, et al.
Published: (2025)