Saved in:
Bibliographic Details
Main Authors: Tao, Zelin, Deng, Hao, Liu, Mingqing, Zhang, Lijun, Zhao, Shengjie
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.17794
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916628935999488
author Tao, Zelin
Deng, Hao
Liu, Mingqing
Zhang, Lijun
Zhao, Shengjie
author_facet Tao, Zelin
Deng, Hao
Liu, Mingqing
Zhang, Lijun
Zhao, Shengjie
contents Online continual learning (OCL), which enables AI systems to adaptively learn from non-stationary data streams, is commonly achieved using experience replay (ER)-based methods that retain knowledge by replaying stored past during training. However, these methods face challenges of prediction bias, stemming from deviations in parameter update directions during task transitions. This paper identifies parameter variation imbalance as a critical factor contributing to prediction bias in ER-based OCL. Specifically, using the proposed parameter variation evaluation method, we highlight two types of imbalance: correlation-induced imbalance, where certain parameters are disproportionately updated across tasks, and layer-wise imbalance, where output layer parameters update faster than those in preceding layers. To mitigate the above imbalances, we propose the Parameter Variation Balancing Framework (PVBF), which incorporates: 1) a novel method to compute parameter correlations with previous tasks based on parameter variations, 2) an encourage-and-consolidate (E&C) method utilizing parameter correlations to perform gradient adjustments across all parameters during training, 3) a dual-layer copy weights with reinit (D-CWR) strategy to slowly update output layer parameters for frequently occuring sample categories. Experiments on short and long task sequences demonstrate that PVBF significantly reduces prediction bias and improves OCL performance, achieving up to 47\% higher accuracy compared to existing ER-based methods.
format Preprint
id arxiv_https___arxiv_org_abs_2502_17794
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PVBF: A Framework for Mitigating Parameter Variation Imbalance in Online Continual Learning
Tao, Zelin
Deng, Hao
Liu, Mingqing
Zhang, Lijun
Zhao, Shengjie
Machine Learning
Online continual learning (OCL), which enables AI systems to adaptively learn from non-stationary data streams, is commonly achieved using experience replay (ER)-based methods that retain knowledge by replaying stored past during training. However, these methods face challenges of prediction bias, stemming from deviations in parameter update directions during task transitions. This paper identifies parameter variation imbalance as a critical factor contributing to prediction bias in ER-based OCL. Specifically, using the proposed parameter variation evaluation method, we highlight two types of imbalance: correlation-induced imbalance, where certain parameters are disproportionately updated across tasks, and layer-wise imbalance, where output layer parameters update faster than those in preceding layers. To mitigate the above imbalances, we propose the Parameter Variation Balancing Framework (PVBF), which incorporates: 1) a novel method to compute parameter correlations with previous tasks based on parameter variations, 2) an encourage-and-consolidate (E&C) method utilizing parameter correlations to perform gradient adjustments across all parameters during training, 3) a dual-layer copy weights with reinit (D-CWR) strategy to slowly update output layer parameters for frequently occuring sample categories. Experiments on short and long task sequences demonstrate that PVBF significantly reduces prediction bias and improves OCL performance, achieving up to 47\% higher accuracy compared to existing ER-based methods.
title PVBF: A Framework for Mitigating Parameter Variation Imbalance in Online Continual Learning
topic Machine Learning
url https://arxiv.org/abs/2502.17794