Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Shengzhuang, Liao, Yikai, Sun, Xiaoxiao, Ma, Kede, Wei, Ying
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2503.04655
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916753786798080
author	Chen, Shengzhuang Liao, Yikai Sun, Xiaoxiao Ma, Kede Wei, Ying
author_facet	Chen, Shengzhuang Liao, Yikai Sun, Xiaoxiao Ma, Kede Wei, Ying
contents	The advent of the foundation model era has sparked significant research interest in leveraging pre-trained representations for continual learning (CL), yielding a series of top-performing CL methods on standard evaluation benchmarks. Nonetheless, there are growing concerns regarding potential data contamination during the pre-training stage. Furthermore, standard evaluation benchmarks, which are typically static, fail to capture the complexities of real-world CL scenarios, resulting in saturated performance. To address these issues, we describe CL on dynamic benchmarks (CLDyB), a general computational framework based on Markov decision processes for evaluating CL methods reliably. CLDyB dynamically identifies inherently difficult and algorithm-dependent tasks for the given CL methods, and determines challenging task orders using Monte Carlo tree search. Leveraging CLDyB, we first conduct a joint evaluation of multiple state-of-the-art CL methods, leading to a set of commonly challenging and generalizable task sequences where existing CL methods tend to perform poorly. We then conduct separate evaluations of individual CL methods using CLDyB, discovering their respective strengths and weaknesses. The source code and generated task sequences are publicly accessible at https://github.com/szc12153/CLDyB.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_04655
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models Chen, Shengzhuang Liao, Yikai Sun, Xiaoxiao Ma, Kede Wei, Ying Machine Learning The advent of the foundation model era has sparked significant research interest in leveraging pre-trained representations for continual learning (CL), yielding a series of top-performing CL methods on standard evaluation benchmarks. Nonetheless, there are growing concerns regarding potential data contamination during the pre-training stage. Furthermore, standard evaluation benchmarks, which are typically static, fail to capture the complexities of real-world CL scenarios, resulting in saturated performance. To address these issues, we describe CL on dynamic benchmarks (CLDyB), a general computational framework based on Markov decision processes for evaluating CL methods reliably. CLDyB dynamically identifies inherently difficult and algorithm-dependent tasks for the given CL methods, and determines challenging task orders using Monte Carlo tree search. Leveraging CLDyB, we first conduct a joint evaluation of multiple state-of-the-art CL methods, leading to a set of commonly challenging and generalizable task sequences where existing CL methods tend to perform poorly. We then conduct separate evaluations of individual CL methods using CLDyB, discovering their respective strengths and weaknesses. The source code and generated task sequences are publicly accessible at https://github.com/szc12153/CLDyB.
title	CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models
topic	Machine Learning
url	https://arxiv.org/abs/2503.04655

Similar Items