Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kermani, Arshia, Zeraatkar, Ehsan, Irani, Habib
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Performance
Online Access:	https://arxiv.org/abs/2502.16627
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912384750190592
author	Kermani, Arshia Zeraatkar, Ehsan Irani, Habib
author_facet	Kermani, Arshia Zeraatkar, Ehsan Irani, Habib
contents	The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment. Our study presents a systematic investigation of optimization techniques, focusing on structured pruning and quantization methods for transformer architectures. Through extensive experimentation on three distinct datasets (RefrigerationDevices, ElectricDevices, and PLAID), we quantitatively evaluate model performance and energy efficiency across different transformer configurations. Our experimental results demonstrate that static quantization reduces energy consumption by 29.14% while maintaining classification performance, and L1 pruning achieves a 63% improvement in inference speed with minimal accuracy degradation. Our findings provide valuable insights into the effectiveness of optimization strategies for transformer-based time series classification, establishing a foundation for efficient model deployment in resource-constrained environments.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_16627
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification Kermani, Arshia Zeraatkar, Ehsan Irani, Habib Machine Learning Artificial Intelligence Performance The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment. Our study presents a systematic investigation of optimization techniques, focusing on structured pruning and quantization methods for transformer architectures. Through extensive experimentation on three distinct datasets (RefrigerationDevices, ElectricDevices, and PLAID), we quantitatively evaluate model performance and energy efficiency across different transformer configurations. Our experimental results demonstrate that static quantization reduces energy consumption by 29.14% while maintaining classification performance, and L1 pruning achieves a 63% improvement in inference speed with minimal accuracy degradation. Our findings provide valuable insights into the effectiveness of optimization strategies for transformer-based time series classification, establishing a foundation for efficient model deployment in resource-constrained environments.
title	Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
topic	Machine Learning Artificial Intelligence Performance
url	https://arxiv.org/abs/2502.16627

Similar Items