Saved in:
Bibliographic Details
Main Authors: Duan, Bowen, Guo, Cong, Wei, Chiyue, Shan, Haoxuan, Fu, Yuzhe, Chen, Xinhua, Xu, Yifan, Zhang, Ziyue, Zhou, Changchun, Li, Hai, Chen, Yiran
Format: Recurso digital
Language:
Published: Zenodo 2026
Online Access:https://doi.org/10.5281/zenodo.19444241
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • <div> <div dir="ltr"> <p>This repository provides the official implementation and artifacts for the ISCA 2026 paper "EVA: Recasting LLM Decoding into GEMM via an Efficient Vector Quantization Architecture."</p> <p>This release corresponds to the artifact-evaluated version of the codebase. It includes all scripts, configuration files, and Jupyter notebooks required to reproduce the hardware performance and algorithm accuracy results reported in the paper.</p> </div> </div>