Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Matykiewicz, Wiktor, Wzorek, Piotr, Jeziorek, Kamil, Muñoz, Tomás, Rios-Navarro, Antonio, Jiménez-Fernández, Angel, Kryjak, Tomasz
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2605.09570
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918493662740480
author	Matykiewicz, Wiktor Wzorek, Piotr Jeziorek, Kamil Muñoz, Tomás Rios-Navarro, Antonio Jiménez-Fernández, Angel Kryjak, Tomasz
author_facet	Matykiewicz, Wiktor Wzorek, Piotr Jeziorek, Kamil Muñoz, Tomás Rios-Navarro, Antonio Jiménez-Fernández, Angel Kryjak, Tomasz
contents	With the rapid growth of mobile robotics and embedded intelligence, there is an increasing demand for efficient on-device data processing on edge platforms. A promising research direction is the use of neuromorphic sensors inspired by human sensory systems, which generate sparse, event-based data encoding changes in the environment. In this work, we present the first end-to-end FPGA implementation of a keyword spotting system that integrates a Neuromorphic Auditory Sensor (NAS) and a graph neural network (GNN) on a single FPGA device, enabling real-time processing of raw audio data. The proposed architecture eliminates conventional signal preprocessing and operates directly on event-based audio streams. Leveraging a compute-near-memory network architecture, the system achieves efficient inference with low latency and low power consumption. Experimental results demonstrate an accuracy of 87.43% after quantization on the Google Speech Commands v2 dataset processed through the neuromorphic sensor, with end-to-end latency below 35 us and average power consumption of 1.12 W. The processed datasets, software models, and hardware modules are available at https://github.com/vision-agh/NAS-GNN-KWS.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_09570
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	End-to-End Keyword Spotting on FPGA Using Graph Neural Networks with a Neuromorphic Auditory Sensor Matykiewicz, Wiktor Wzorek, Piotr Jeziorek, Kamil Muñoz, Tomás Rios-Navarro, Antonio Jiménez-Fernández, Angel Kryjak, Tomasz Machine Learning With the rapid growth of mobile robotics and embedded intelligence, there is an increasing demand for efficient on-device data processing on edge platforms. A promising research direction is the use of neuromorphic sensors inspired by human sensory systems, which generate sparse, event-based data encoding changes in the environment. In this work, we present the first end-to-end FPGA implementation of a keyword spotting system that integrates a Neuromorphic Auditory Sensor (NAS) and a graph neural network (GNN) on a single FPGA device, enabling real-time processing of raw audio data. The proposed architecture eliminates conventional signal preprocessing and operates directly on event-based audio streams. Leveraging a compute-near-memory network architecture, the system achieves efficient inference with low latency and low power consumption. Experimental results demonstrate an accuracy of 87.43% after quantization on the Google Speech Commands v2 dataset processed through the neuromorphic sensor, with end-to-end latency below 35 us and average power consumption of 1.12 W. The processed datasets, software models, and hardware modules are available at https://github.com/vision-agh/NAS-GNN-KWS.
title	End-to-End Keyword Spotting on FPGA Using Graph Neural Networks with a Neuromorphic Auditory Sensor
topic	Machine Learning
url	https://arxiv.org/abs/2605.09570

Similar Items