Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhang, Tong, Mailthody, Vikram Sharma, Sun, Fei, Ma, Linsen, Newburn, Chris J., Zhang, Teresa, Liu, Yang, Li, Jiangpeng, Zhong, Hao, Hwu, Wen-Mei
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2511.03944
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866915994568491008
author Zhang, Tong
Mailthody, Vikram Sharma
Sun, Fei
Ma, Linsen
Newburn, Chris J.
Zhang, Teresa
Liu, Yang
Li, Jiangpeng
Zhong, Hao
Hwu, Wen-Mei
author_facet Zhang, Tong
Mailthody, Vikram Sharma
Sun, Fei
Ma, Linsen
Newburn, Chris J.
Zhang, Teresa
Liu, Yang
Li, Jiangpeng
Zhong, Hao
Hwu, Wen-Mei
contents In 1987, Jim Gray and Gianfranco Putzolu introduced the five-minute rule, a simple, storage-memory-economics-based heuristic for deciding when data should live in DRAM rather than on storage. Subsequent revisits to the rule largely retained that economics-only view, leaving host costs, feasibility limits, and workload behavior out of scope. This paper revisits the rule from first principles, integrating host costs, DRAM bandwidth/capacity, and physics-grounded models of SSD performance and cost, and then embedding these elements in a constraint- and workload-aware framework that yields actionable provisioning guidance. We show that, for modern AI platforms, especially GPU-centric hosts paired with ultra-high-IOPS SSDs engineered for fine-grained random access, the DRAM$\leftrightarrow$flash caching threshold collapses from minutes to a few seconds. This shift reframes NAND flash memory as an \emph{active data tier} and exposes a broad research space across the hardware-software stack. We further introduce MQSim-Next, a calibrated SSD simulator that supports validation and sensitivity analysis and facilitates future architectural and system research. Finally, we present two concrete case studies that showcase the software system design space opened by such memory hierarchy paradigm shift. Overall, we turn a classical heuristic into an actionable, feasibility-aware analysis and provisioning framework and set the stage for further research on AI-era memory hierarchy.
format Preprint
id arxiv_https___arxiv_org_abs_2511_03944
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Five-Minute Rule 40 Years Later: A First-Principles Revisit for Modern Memory Hierarchy
Zhang, Tong
Mailthody, Vikram Sharma
Sun, Fei
Ma, Linsen
Newburn, Chris J.
Zhang, Teresa
Liu, Yang
Li, Jiangpeng
Zhong, Hao
Hwu, Wen-Mei
Hardware Architecture
In 1987, Jim Gray and Gianfranco Putzolu introduced the five-minute rule, a simple, storage-memory-economics-based heuristic for deciding when data should live in DRAM rather than on storage. Subsequent revisits to the rule largely retained that economics-only view, leaving host costs, feasibility limits, and workload behavior out of scope. This paper revisits the rule from first principles, integrating host costs, DRAM bandwidth/capacity, and physics-grounded models of SSD performance and cost, and then embedding these elements in a constraint- and workload-aware framework that yields actionable provisioning guidance. We show that, for modern AI platforms, especially GPU-centric hosts paired with ultra-high-IOPS SSDs engineered for fine-grained random access, the DRAM$\leftrightarrow$flash caching threshold collapses from minutes to a few seconds. This shift reframes NAND flash memory as an \emph{active data tier} and exposes a broad research space across the hardware-software stack. We further introduce MQSim-Next, a calibrated SSD simulator that supports validation and sensitivity analysis and facilitates future architectural and system research. Finally, we present two concrete case studies that showcase the software system design space opened by such memory hierarchy paradigm shift. Overall, we turn a classical heuristic into an actionable, feasibility-aware analysis and provisioning framework and set the stage for further research on AI-era memory hierarchy.
title Five-Minute Rule 40 Years Later: A First-Principles Revisit for Modern Memory Hierarchy
topic Hardware Architecture
url https://arxiv.org/abs/2511.03944