Saved in:
Bibliographic Details
Main Authors: Jin, Ruibing, Xu, Qing, Wu, Min, Xu, Yuecong, Li, Dan, Li, Xiaoli, Chen, Zhenghua
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.08765
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917692412264448
author Jin, Ruibing
Xu, Qing
Wu, Min
Xu, Yuecong
Li, Dan
Li, Xiaoli
Chen, Zhenghua
author_facet Jin, Ruibing
Xu, Qing
Wu, Min
Xu, Yuecong
Li, Dan
Li, Xiaoli
Chen, Zhenghua
contents Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive generalization in diverse fields. Although massive LLM based approaches are proposed for time series tasks, these methods require to load the whole LLM in both training and reference. This high computational demands limit practical applications in resource-constrained settings, like edge-computing and IoT devices. To address this issue, we propose Knowledge Pruning (KP), a novel paradigm for time series learning in this paper. For a specific downstream task, we argue that the world knowledge learned by LLMs is much redundant and only the related knowledge termed as "pertinent knowledge" is useful. Unlike other methods, our KP targets to prune the redundant knowledge and only distill the pertinent knowledge into the target model. This reduces model size and computational costs significantly. Additionally, different from existing LLM based approaches, our KP does not require to load the LLM in the process of training and testing, further easing computational burdens. With our proposed KP, a lightweight network can effectively learn the pertinent knowledge, achieving satisfactory performances with a low computation cost. To verify the effectiveness of our KP, two fundamental tasks on edge-computing devices are investigated in our experiments, where eight diverse environments or benchmarks with different networks are used to verify the generalization of our KP. Through experiments, our KP demonstrates effective learning of pertinent knowledge, achieving notable performance improvements in regression (19.7% on average) and classification (up to 13.7%) tasks, showcasing state-of-the-art results.
format Preprint
id arxiv_https___arxiv_org_abs_2406_08765
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices
Jin, Ruibing
Xu, Qing
Wu, Min
Xu, Yuecong
Li, Dan
Li, Xiaoli
Chen, Zhenghua
Machine Learning
Limited by the scale and diversity of time series data, the neural networks trained on time series data often overfit and show unsatisfacotry performances. In comparison, large language models (LLMs) recently exhibit impressive generalization in diverse fields. Although massive LLM based approaches are proposed for time series tasks, these methods require to load the whole LLM in both training and reference. This high computational demands limit practical applications in resource-constrained settings, like edge-computing and IoT devices. To address this issue, we propose Knowledge Pruning (KP), a novel paradigm for time series learning in this paper. For a specific downstream task, we argue that the world knowledge learned by LLMs is much redundant and only the related knowledge termed as "pertinent knowledge" is useful. Unlike other methods, our KP targets to prune the redundant knowledge and only distill the pertinent knowledge into the target model. This reduces model size and computational costs significantly. Additionally, different from existing LLM based approaches, our KP does not require to load the LLM in the process of training and testing, further easing computational burdens. With our proposed KP, a lightweight network can effectively learn the pertinent knowledge, achieving satisfactory performances with a low computation cost. To verify the effectiveness of our KP, two fundamental tasks on edge-computing devices are investigated in our experiments, where eight diverse environments or benchmarks with different networks are used to verify the generalization of our KP. Through experiments, our KP demonstrates effective learning of pertinent knowledge, achieving notable performance improvements in regression (19.7% on average) and classification (up to 13.7%) tasks, showcasing state-of-the-art results.
title LLM-based Knowledge Pruning for Time Series Data Analytics on Edge-computing Devices
topic Machine Learning
url https://arxiv.org/abs/2406.08765