Saved in:
Bibliographic Details
Main Authors: Chen, Yizhi, Hemani, Ahmed
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.09451
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We propose Quamba-SE, a soft-edge quantizer for State Space Model (SSM) activation quantization. Unlike existing methods, using standard INT8 operation, Quamba-SE employs three adaptive scales: high-precision for small values, standard scale for normal values, and low-precision for outliers. This preserves outlier information instead of hard clipping, while maintaining precision for other values. We evaluate on Mamba- 130M across 6 zero-shot benchmarks. Results show that Quamba- SE consistently outperforms Quamba, achieving up to +2.68% on individual benchmarks and up to +0.83% improvement in the average accuracy of 6 datasets.