Enregistré dans:
Détails bibliographiques
Auteurs principaux: Yang, Wulve, Zou, Hailong, Zhou, Rui, Zhang, Jionghao, Li, Qiang, Li, Gang, Zhan, Yi, Qiao, Shushan
Format: Preprint
Publié: 2026
Sujets:
Accès en ligne:https://arxiv.org/abs/2603.07962
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866915880131100672
author Yang, Wulve
Zou, Hailong
Zhou, Rui
Zhang, Jionghao
Li, Qiang
Li, Gang
Zhan, Yi
Qiao, Shushan
author_facet Yang, Wulve
Zou, Hailong
Zhou, Rui
Zhang, Jionghao
Li, Qiang
Li, Gang
Zhan, Yi
Qiao, Shushan
contents General matrix multiplication (GEMM) on spatial accelerators is highly sensitive to mapping choices in both execution efficiency and energy consumption. However, the mapping space exhibits combinatorial explosion, which makes it extremely challenging to obtain optimal mappings within an acceptable time budget. Existing approaches typically face challenges: They often lack global-optimality guarantees and become prohibitively slow as the mapping space grows. To address these limitations, we propose \textsc{GOMA}, a geometric-abstraction-based, globally optimal GEMM mapping framework via analytical modeling, which achieves efficient solving while guaranteeing optimality. \textsc{GOMA} introduces, from first principles, a geometric abstraction for GEMM mapping, yielding an exact analytical energy objective with $O(1)$ evaluation for any given mapping. The objective is highly accurate. \textsc{GOMA} then formulates mapping selection as an integer optimization problem under hardware and mapping constraints, using the analytical energy model as the objective to automate mapping search. \textsc{GOMA} can quickly compute a global-optimal mapping for any (GEMM workload, target hardware) pair, achieving this for the first time in mapping space exploration. Experiments confirm that across representative accelerators and large language model prefill workloads, \textsc{GOMA} improves the energy--delay product (EDP) by $2.24$--$4.24\times$ over SOTA mappers, while accelerating time-to-solution by $3.83$--$73.6\times$.
format Preprint
id arxiv_https___arxiv_org_abs_2603_07962
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial Accelerators
Yang, Wulve
Zou, Hailong
Zhou, Rui
Zhang, Jionghao
Li, Qiang
Li, Gang
Zhan, Yi
Qiao, Shushan
Hardware Architecture
General matrix multiplication (GEMM) on spatial accelerators is highly sensitive to mapping choices in both execution efficiency and energy consumption. However, the mapping space exhibits combinatorial explosion, which makes it extremely challenging to obtain optimal mappings within an acceptable time budget. Existing approaches typically face challenges: They often lack global-optimality guarantees and become prohibitively slow as the mapping space grows. To address these limitations, we propose \textsc{GOMA}, a geometric-abstraction-based, globally optimal GEMM mapping framework via analytical modeling, which achieves efficient solving while guaranteeing optimality. \textsc{GOMA} introduces, from first principles, a geometric abstraction for GEMM mapping, yielding an exact analytical energy objective with $O(1)$ evaluation for any given mapping. The objective is highly accurate. \textsc{GOMA} then formulates mapping selection as an integer optimization problem under hardware and mapping constraints, using the analytical energy model as the objective to automate mapping search. \textsc{GOMA} can quickly compute a global-optimal mapping for any (GEMM workload, target hardware) pair, achieving this for the first time in mapping space exploration. Experiments confirm that across representative accelerators and large language model prefill workloads, \textsc{GOMA} improves the energy--delay product (EDP) by $2.24$--$4.24\times$ over SOTA mappers, while accelerating time-to-solution by $3.83$--$73.6\times$.
title GOMA: Geometrically Optimal Mapping via Analytical Modeling for Spatial Accelerators
topic Hardware Architecture
url https://arxiv.org/abs/2603.07962