Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Rai, Saurabh, Prisha, Kumar, Jitendra
Format:	Preprint
Published:	2025
Subjects:	High Energy Physics - Phenomenology
Online Access:	https://arxiv.org/abs/2508.07431
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915438022098944
author	Rai, Saurabh Prisha Kumar, Jitendra
author_facet	Rai, Saurabh Prisha Kumar, Jitendra
contents	The increasing scale of deep learning models in high-energy physics (HEP) has posed challenges to their deployment on low-power, latency-sensitive platforms, such as FPGAs and ASICs used in trigger systems, as well as in offline data reconstruction and processing pipelines. In this work, we introduce BitParT, a 1-bit Transformer-based architecture designed specifically for the top-quark tagging method. Building upon recent advances in ultra-low-bit large language models (LLMs), we extended these ideas to the HEP domain by developing a binary-weight variant (BitParT) of the Particle Transformer (ParT) model. Our findings indicate a potential for substantial reduction in model size and computational complexity, while maintaining high tagging performance. We benchmark BitParT on the public Top Quark Tagging Reference Dataset and show that it achieves competitive performance relative to its full-precision counterpart. This work demonstrates the design of extreme quantized models for physics applications, paving the way for real-time inference in collider experiments with minimal and optimized resource usage.
format	Preprint
id	arxiv_https___arxiv_org_abs_2508_07431
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Investigating 1-Bit Quantization in Transformer-Based Top Tagging Rai, Saurabh Prisha Kumar, Jitendra High Energy Physics - Phenomenology The increasing scale of deep learning models in high-energy physics (HEP) has posed challenges to their deployment on low-power, latency-sensitive platforms, such as FPGAs and ASICs used in trigger systems, as well as in offline data reconstruction and processing pipelines. In this work, we introduce BitParT, a 1-bit Transformer-based architecture designed specifically for the top-quark tagging method. Building upon recent advances in ultra-low-bit large language models (LLMs), we extended these ideas to the HEP domain by developing a binary-weight variant (BitParT) of the Particle Transformer (ParT) model. Our findings indicate a potential for substantial reduction in model size and computational complexity, while maintaining high tagging performance. We benchmark BitParT on the public Top Quark Tagging Reference Dataset and show that it achieves competitive performance relative to its full-precision counterpart. This work demonstrates the design of extreme quantized models for physics applications, paving the way for real-time inference in collider experiments with minimal and optimized resource usage.
title	Investigating 1-Bit Quantization in Transformer-Based Top Tagging
topic	High Energy Physics - Phenomenology
url	https://arxiv.org/abs/2508.07431

Similar Items