Saved in:
Bibliographic Details
Main Authors: Yan, Zhanglu, Mao, Jiayi, Liu, Qianhui, Li, Fanfan, Pan, Gang, Luo, Tao, Zhu, Bowen, Wong, Weng-Fai
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.18968
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914052286971904
author Yan, Zhanglu
Mao, Jiayi
Liu, Qianhui
Li, Fanfan
Pan, Gang
Luo, Tao
Zhu, Bowen
Wong, Weng-Fai
author_facet Yan, Zhanglu
Mao, Jiayi
Liu, Qianhui
Li, Fanfan
Pan, Gang
Luo, Tao
Zhu, Bowen
Wong, Weng-Fai
contents Spiking neural networks (SNNs) promise high energy efficiency, particularly with time-to-first-spike (TTFS) encoding, which maximizes sparsity by emitting at most one spike per neuron. However, such energy advantage is often unrealized because inference requires evaluating a temporal decay function and subsequent multiplication with the synaptic weights. This paper challenges this costly approach by repurposing a physical hardware `bug', namely, the natural signal decay in optoelectronic devices, as the core computation of TTFS. We fabricated a custom indium oxide optoelectronic synapse, showing how its natural physical decay directly implements the required temporal function. By treating the device's analog output as the fused product of the synaptic weight and temporal decay, optoelectronic synaptic TTFS (named Otters) eliminates these expensive digital operations. To use the Otters paradigm in complex architectures like the transformer, which are challenging to train directly due to the sparsity issue, we introduce a novel quantized neural network-to-SNN conversion algorithm. This complete hardware-software co-design enables our model to achieve state-of-the-art accuracy across seven GLUE benchmark datasets and demonstrates a 1.77$\times$ improvement in energy efficiency over previous leading SNNs, based on a comprehensive analysis of compute, data movement, and memory access costs using energy measurements from a commercial 22nm process. Our work thus establishes a new paradigm for energy-efficient SNNs, translating fundamental device physics directly into powerful computational primitives. All codes and data are open source.
format Preprint
id arxiv_https___arxiv_org_abs_2509_18968
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Otters: An Energy-Efficient SpikingTransformer via Optical Time-to-First-Spike Encoding
Yan, Zhanglu
Mao, Jiayi
Liu, Qianhui
Li, Fanfan
Pan, Gang
Luo, Tao
Zhu, Bowen
Wong, Weng-Fai
Machine Learning
Spiking neural networks (SNNs) promise high energy efficiency, particularly with time-to-first-spike (TTFS) encoding, which maximizes sparsity by emitting at most one spike per neuron. However, such energy advantage is often unrealized because inference requires evaluating a temporal decay function and subsequent multiplication with the synaptic weights. This paper challenges this costly approach by repurposing a physical hardware `bug', namely, the natural signal decay in optoelectronic devices, as the core computation of TTFS. We fabricated a custom indium oxide optoelectronic synapse, showing how its natural physical decay directly implements the required temporal function. By treating the device's analog output as the fused product of the synaptic weight and temporal decay, optoelectronic synaptic TTFS (named Otters) eliminates these expensive digital operations. To use the Otters paradigm in complex architectures like the transformer, which are challenging to train directly due to the sparsity issue, we introduce a novel quantized neural network-to-SNN conversion algorithm. This complete hardware-software co-design enables our model to achieve state-of-the-art accuracy across seven GLUE benchmark datasets and demonstrates a 1.77$\times$ improvement in energy efficiency over previous leading SNNs, based on a comprehensive analysis of compute, data movement, and memory access costs using energy measurements from a commercial 22nm process. Our work thus establishes a new paradigm for energy-efficient SNNs, translating fundamental device physics directly into powerful computational primitives. All codes and data are open source.
title Otters: An Energy-Efficient SpikingTransformer via Optical Time-to-First-Spike Encoding
topic Machine Learning
url https://arxiv.org/abs/2509.18968