Saved in:
Bibliographic Details
Main Authors: Rai, Saurabh, Prisha, Kumar, Jitendra
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.07431
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915438022098944
author Rai, Saurabh
Prisha
Kumar, Jitendra
author_facet Rai, Saurabh
Prisha
Kumar, Jitendra
contents The increasing scale of deep learning models in high-energy physics (HEP) has posed challenges to their deployment on low-power, latency-sensitive platforms, such as FPGAs and ASICs used in trigger systems, as well as in offline data reconstruction and processing pipelines. In this work, we introduce BitParT, a 1-bit Transformer-based architecture designed specifically for the top-quark tagging method. Building upon recent advances in ultra-low-bit large language models (LLMs), we extended these ideas to the HEP domain by developing a binary-weight variant (BitParT) of the Particle Transformer (ParT) model. Our findings indicate a potential for substantial reduction in model size and computational complexity, while maintaining high tagging performance. We benchmark BitParT on the public Top Quark Tagging Reference Dataset and show that it achieves competitive performance relative to its full-precision counterpart. This work demonstrates the design of extreme quantized models for physics applications, paving the way for real-time inference in collider experiments with minimal and optimized resource usage.
format Preprint
id arxiv_https___arxiv_org_abs_2508_07431
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Investigating 1-Bit Quantization in Transformer-Based Top Tagging
Rai, Saurabh
Prisha
Kumar, Jitendra
High Energy Physics - Phenomenology
The increasing scale of deep learning models in high-energy physics (HEP) has posed challenges to their deployment on low-power, latency-sensitive platforms, such as FPGAs and ASICs used in trigger systems, as well as in offline data reconstruction and processing pipelines. In this work, we introduce BitParT, a 1-bit Transformer-based architecture designed specifically for the top-quark tagging method. Building upon recent advances in ultra-low-bit large language models (LLMs), we extended these ideas to the HEP domain by developing a binary-weight variant (BitParT) of the Particle Transformer (ParT) model. Our findings indicate a potential for substantial reduction in model size and computational complexity, while maintaining high tagging performance. We benchmark BitParT on the public Top Quark Tagging Reference Dataset and show that it achieves competitive performance relative to its full-precision counterpart. This work demonstrates the design of extreme quantized models for physics applications, paving the way for real-time inference in collider experiments with minimal and optimized resource usage.
title Investigating 1-Bit Quantization in Transformer-Based Top Tagging
topic High Energy Physics - Phenomenology
url https://arxiv.org/abs/2508.07431