:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Gruber, Bernhard Manfred
Format:	Preprint
Published:	2023
Subjects:	Performance
Online Access:	https://arxiv.org/abs/2302.08251
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Examem: Low-Overhead Memory Instrumentation for Intelligent Memory Systems
by: Poduval, Ashwin, et al.
Published: (2024)

FB$^+$-tree: A Memory-Optimized B$^+$-tree with Latch-Free Update
by: Chen, Yuan, et al.
Published: (2025)

Machine Learning-Guided Memory Optimization for DLRM Inference on Tiered Memory
by: Ren, Jie, et al.
Published: (2025)

Tuning Fast Memory Size based on Modeling of Page Migration for Tiered Memory
by: Chen, Shangye, et al.
Published: (2024)

Heterogeneous Memory Pool Tuning
by: Vaverka, Filip, et al.
Published: (2025)

Multi-Strided Access Patterns to Boost Hardware Prefetching
by: Blom, Miguel O., et al.
Published: (2024)

Columbo: Low Level End-to-End System Traces through Modular Full-System Simulation
by: Görgen, Jakob, et al.
Published: (2024)

FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
by: Shao, Zishan, et al.
Published: (2025)

Modeling Utilization to Identify Shared-Memory Atomic Bottlenecks
by: Dong, Rongcui, et al.
Published: (2025)

Update!
by: Heller, Franziska
Published: (2020)

EDAN: Towards Understanding Memory Parallelism and Latency Sensitivity in HPC
by: Shen, Siyuan, et al.
Published: (2025)

Optimizing CPU Cache Utilization in Cloud VMs with Accurate Cache Abstraction
by: Tofigh, Mani, et al.
Published: (2025)

CloverLeaf on Intel Multi-Core CPUs: A Case Study in Write-Allocate Evasion
by: Laukemann, Jan, et al.
Published: (2023)

Analysis and Evaluation of Using Microsecond-Latency Memory for In-Memory Indices and Caches in SSD-Based Key-Value Stores
by: Bando, Yosuke, et al.
Published: (2025)

WritePolicyBench: Benchmarking Memory Write Policies under Byte Budgets
by: Cham, Edgard El
Published: (2026)

MultiPath Memory Access: Breaking Host-GPU Bandwidth Bottlenecks in LLM Services
by: Tang, Lingfeng, et al.
Published: (2025)

Repr Types: One Abstraction to Rule Them All
by: Palmkvist, Viktor, et al.
Published: (2024)

Meta-Metrics and Best Practices for System-Level Inference Performance Benchmarking
by: Salaria, Shweta, et al.
Published: (2025)

Sampling in Cloud Benchmarking: A Critical Review and Methodological Guidelines
by: Akbari, Saman, et al.
Published: (2025)

Hiku: Pull-Based Scheduling for Serverless Computing
by: Akbari, Saman, et al.
Published: (2025)

Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing
by: Akbari, Saman, et al.
Published: (2025)

Count-Min Sketch with Conservative Updates: Worst-Case Analysis
by: Mazziane, Younes Ben, et al.
Published: (2024)

Heterogeneous Memory Benchmarking Toolkit
by: Ghaemi, Golsana, et al.
Published: (2025)

Heterogeneous Data Access Model for Concurrency Control and Methods to Deal with High Data Contention
by: Thomasian, Alexander
Published: (2024)

Optimizing System Memory Bandwidth with Micron CXL Memory Expansion Modules on Intel Xeon 6 Processors
by: Sehgal, Rohit, et al.
Published: (2024)

SparseX: Efficient Segment-Level KV Cache Sharing for Interleaved LLM Serving
by: Zhang, Quqing, et al.
Published: (2026)

Memshare: Memory Sharing for Multicore Computation in R with an Application to Feature Selection by Mutual Information using PDE
by: Thrun, Michael C., et al.
Published: (2025)

Dependence-Driven, Scalable Quantum Circuit Mapping with Affine Abstractions
by: Benbetka, Marouane, et al.
Published: (2025)

Unikernels vs. Containers: A Runtime-Level Performance Comparison for Resource-Constrained Edge Workloads
by: Dinh-Tuan, Hai
Published: (2025)

HD-MoE: Hybrid and Dynamic Parallelism for Mixture-of-Expert LLMs with 3D Near-Memory Processing
by: Huang, Haochen, et al.
Published: (2025)

Virtual-Memory Powersort
by: Moltmann, Finn, et al.
Published: (2026)

Lagrange Index based Scheduling for Minimizing Age of Updates from Heterogeneous Sources
by: Mukherjee, Aniket, et al.
Published: (2026)

A Controlled Study of Memory Hierarchy Transitions in Quantum Circuit Simulation on Apple M4 Pro Unified Memory Architecture
by: Pratipat, Gyan
Published: (2026)

Alya towards Exascale: Optimal OpenACC Performance of the Navier-Stokes Finite Element Assembly on GPUs
by: Owen, Herbert, et al.
Published: (2024)

Memory Analysis on the Training Course of DeepSeek Models
by: Zhang, Ping, et al.
Published: (2025)

Performance Characterization of AutoNUMA Memory Tiering on Graph Analytics
by: Moura, Diego, et al.
Published: (2022)

ETM2: Empowering Traditional Memory Bandwidth Regulation using ETM
by: Zuepke, Alexander, et al.
Published: (2026)

Delegation with Trust<T>: A Scalable, Type- and Memory-Safe Alternative to Locks
by: Ahmad, Noaman, et al.
Published: (2024)

A$^3$PIM: An Automated, Analytic and Accurate Processing-in-Memory Offloader
by: Jiang, Qingcai, et al.
Published: (2024)

Reducing Compute Waste in LLMs through Kernel-Level DVFS
by: Spaan, Jeffrey, et al.
Published: (2026)