Saved in:
Bibliographic Details
Main Authors: Johnson, Avery, Islam, Mohammad Majharul, Akram, Riad, Muzahid, Abdullah
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.19330
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917353983311872
author Johnson, Avery
Islam, Mohammad Majharul
Akram, Riad
Muzahid, Abdullah
author_facet Johnson, Avery
Islam, Mohammad Majharul
Akram, Riad
Muzahid, Abdullah
contents The exponential increase in complex IPs within modern SoCs, driven by Moore's Law, has created a pressing need for fast and accurate hardware-software power-performance analysis. Traditional performance simulators (such as cycle accurate simulators) are often too slow to simulate full benchmarks within a reasonable timeframe; require considerable effort for development, maintenance, and extensions; and are prone to errors, making pre-silicon performance projections and competitive analysis increasingly challenging. Prior attempts in addressing this challenge using machine learning fall short as they are either slow, inaccurate or unable to predict the performance of full benchmarks. To address these limitations, we present PAI, the first technique to accurately predict full benchmark performance without relying on detailed simulation or instruction-wise encoding. At the heart of PAI is a hierarchical Long Short Term Memory (LSTM)-based model that takes a trace of microarchitecture independent features from a program execution and predicts performance metrics. We present the detailed design, implementation and evaluation of PAI. Our initial experiments showed that PAI can achieve an average IPC prediction error of 9.35% for SPEC CPU 2017 benchmark suite while taking only 2 min 57 sec for the entire suite. This prediction error is comparable to prior state-of-the-art techniques while requiring 3 orders of magnitude less time.
format Preprint
id arxiv_https___arxiv_org_abs_2603_19330
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle PAI: Fast, Accurate, and Full Benchmark Performance Projection with AI
Johnson, Avery
Islam, Mohammad Majharul
Akram, Riad
Muzahid, Abdullah
Hardware Architecture
Artificial Intelligence
The exponential increase in complex IPs within modern SoCs, driven by Moore's Law, has created a pressing need for fast and accurate hardware-software power-performance analysis. Traditional performance simulators (such as cycle accurate simulators) are often too slow to simulate full benchmarks within a reasonable timeframe; require considerable effort for development, maintenance, and extensions; and are prone to errors, making pre-silicon performance projections and competitive analysis increasingly challenging. Prior attempts in addressing this challenge using machine learning fall short as they are either slow, inaccurate or unable to predict the performance of full benchmarks. To address these limitations, we present PAI, the first technique to accurately predict full benchmark performance without relying on detailed simulation or instruction-wise encoding. At the heart of PAI is a hierarchical Long Short Term Memory (LSTM)-based model that takes a trace of microarchitecture independent features from a program execution and predicts performance metrics. We present the detailed design, implementation and evaluation of PAI. Our initial experiments showed that PAI can achieve an average IPC prediction error of 9.35% for SPEC CPU 2017 benchmark suite while taking only 2 min 57 sec for the entire suite. This prediction error is comparable to prior state-of-the-art techniques while requiring 3 orders of magnitude less time.
title PAI: Fast, Accurate, and Full Benchmark Performance Projection with AI
topic Hardware Architecture
Artificial Intelligence
url https://arxiv.org/abs/2603.19330