Saved in:
Bibliographic Details
Main Authors: Kurzynski, Marco, Sinclair, Matthew D.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.18113
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • In this work we have enhanced gem5's GPU model support to add Matrix Core Engines (MCEs). Specifically, on the AMD MI200 and MI300 GPUs that gem5 supports, these MCEs perform Matrix Fused Multiply Add (MFMA) instructions for a variety of precisions. By adding this support, our changes enable running state-of-the-art ML workloads in gem5, as well as examining how MCE optimizations impact the behavior of future systems.