Saved in:
Bibliographic Details
Main Authors: Aksoy, Alperen, Bekman, Ilja, Dimitrov, Vesselin, Dorosti, Qader, Eguzo, Chimezie, Fleitmann, Sarah, Grewing, Christian, Hader, Fabian, Zambanini, Andre, van Waasen, Stefan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.05479
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper examines the use of Quantized Neural Networks (QNNs) for two resource-constrained scientific applications: automated calibration of semi-conductor quantum bits (qubits) and scientific particle detectors. We evaluate the trade-offs between Post-Training Quantization (PTQ), Quantization-Aware Training (QAT), and ultra-low-bit Binary Neural Networks (BNNs) with respect to latency and resource usage. Our results demonstrate that PTQ achieves a four-fold reduction in memory usage for U-shaped CNN (U-Net) architectures while maintaining or slightly enhancing segmentation accuracy (e.g. from 89% to 90% for a small U-Net with 447 parameters). For the training of non-differentiable custom BNNs , we propose a novel, hardware-constrained learning approach using Genetic Algorithms (GAs). We showcase a LUT-based BNN architecture suitable for direct conversion to VHDL via the HCL4BNN framework. This method achieves nanosecond-scale inference latencies (10-15 ns) without requiring specialized DSP or BRAM resources.