Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tao, Zelin, Deng, Hao, Liu, Mingqing, Zhang, Lijun, Zhao, Shengjie
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.17794
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916628935999488
author	Tao, Zelin Deng, Hao Liu, Mingqing Zhang, Lijun Zhao, Shengjie
author_facet	Tao, Zelin Deng, Hao Liu, Mingqing Zhang, Lijun Zhao, Shengjie
contents	Online continual learning (OCL), which enables AI systems to adaptively learn from non-stationary data streams, is commonly achieved using experience replay (ER)-based methods that retain knowledge by replaying stored past during training. However, these methods face challenges of prediction bias, stemming from deviations in parameter update directions during task transitions. This paper identifies parameter variation imbalance as a critical factor contributing to prediction bias in ER-based OCL. Specifically, using the proposed parameter variation evaluation method, we highlight two types of imbalance: correlation-induced imbalance, where certain parameters are disproportionately updated across tasks, and layer-wise imbalance, where output layer parameters update faster than those in preceding layers. To mitigate the above imbalances, we propose the Parameter Variation Balancing Framework (PVBF), which incorporates: 1) a novel method to compute parameter correlations with previous tasks based on parameter variations, 2) an encourage-and-consolidate (E&C) method utilizing parameter correlations to perform gradient adjustments across all parameters during training, 3) a dual-layer copy weights with reinit (D-CWR) strategy to slowly update output layer parameters for frequently occuring sample categories. Experiments on short and long task sequences demonstrate that PVBF significantly reduces prediction bias and improves OCL performance, achieving up to 47\% higher accuracy compared to existing ER-based methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_17794
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	PVBF: A Framework for Mitigating Parameter Variation Imbalance in Online Continual Learning Tao, Zelin Deng, Hao Liu, Mingqing Zhang, Lijun Zhao, Shengjie Machine Learning Online continual learning (OCL), which enables AI systems to adaptively learn from non-stationary data streams, is commonly achieved using experience replay (ER)-based methods that retain knowledge by replaying stored past during training. However, these methods face challenges of prediction bias, stemming from deviations in parameter update directions during task transitions. This paper identifies parameter variation imbalance as a critical factor contributing to prediction bias in ER-based OCL. Specifically, using the proposed parameter variation evaluation method, we highlight two types of imbalance: correlation-induced imbalance, where certain parameters are disproportionately updated across tasks, and layer-wise imbalance, where output layer parameters update faster than those in preceding layers. To mitigate the above imbalances, we propose the Parameter Variation Balancing Framework (PVBF), which incorporates: 1) a novel method to compute parameter correlations with previous tasks based on parameter variations, 2) an encourage-and-consolidate (E&C) method utilizing parameter correlations to perform gradient adjustments across all parameters during training, 3) a dual-layer copy weights with reinit (D-CWR) strategy to slowly update output layer parameters for frequently occuring sample categories. Experiments on short and long task sequences demonstrate that PVBF significantly reduces prediction bias and improves OCL performance, achieving up to 47\% higher accuracy compared to existing ER-based methods.
title	PVBF: A Framework for Mitigating Parameter Variation Imbalance in Online Continual Learning
topic	Machine Learning
url	https://arxiv.org/abs/2502.17794

Similar Items