Saved in:
Bibliographic Details
Main Authors: Kermani, Arshia, Zeraatkar, Ehsan, Irani, Habib
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.16627
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912384750190592
author Kermani, Arshia
Zeraatkar, Ehsan
Irani, Habib
author_facet Kermani, Arshia
Zeraatkar, Ehsan
Irani, Habib
contents The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment. Our study presents a systematic investigation of optimization techniques, focusing on structured pruning and quantization methods for transformer architectures. Through extensive experimentation on three distinct datasets (RefrigerationDevices, ElectricDevices, and PLAID), we quantitatively evaluate model performance and energy efficiency across different transformer configurations. Our experimental results demonstrate that static quantization reduces energy consumption by 29.14% while maintaining classification performance, and L1 pruning achieves a 63% improvement in inference speed with minimal accuracy degradation. Our findings provide valuable insights into the effectiveness of optimization strategies for transformer-based time series classification, establishing a foundation for efficient model deployment in resource-constrained environments.
format Preprint
id arxiv_https___arxiv_org_abs_2502_16627
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
Kermani, Arshia
Zeraatkar, Ehsan
Irani, Habib
Machine Learning
Artificial Intelligence
Performance
The increasing computational demands of transformer models in time series classification necessitate effective optimization strategies for energy-efficient deployment. Our study presents a systematic investigation of optimization techniques, focusing on structured pruning and quantization methods for transformer architectures. Through extensive experimentation on three distinct datasets (RefrigerationDevices, ElectricDevices, and PLAID), we quantitatively evaluate model performance and energy efficiency across different transformer configurations. Our experimental results demonstrate that static quantization reduces energy consumption by 29.14% while maintaining classification performance, and L1 pruning achieves a 63% improvement in inference speed with minimal accuracy degradation. Our findings provide valuable insights into the effectiveness of optimization strategies for transformer-based time series classification, establishing a foundation for efficient model deployment in resource-constrained environments.
title Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification
topic Machine Learning
Artificial Intelligence
Performance
url https://arxiv.org/abs/2502.16627