Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kessler, Samuel, Cobb, Adam, Rudner, Tim G. J., Zohren, Stefan, Roberts, Stephen J.
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2301.01828
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909450612244480
author	Kessler, Samuel Cobb, Adam Rudner, Tim G. J. Zohren, Stefan Roberts, Stephen J.
author_facet	Kessler, Samuel Cobb, Adam Rudner, Tim G. J. Zohren, Stefan Roberts, Stephen J.
contents	Sequential Bayesian inference can be used for continual learning to prevent catastrophic forgetting of past tasks and provide an informative prior when learning new tasks. We revisit sequential Bayesian inference and test whether having access to the true posterior is guaranteed to prevent catastrophic forgetting in Bayesian neural networks. To do this we perform sequential Bayesian inference using Hamiltonian Monte Carlo. We propagate the posterior as a prior for new tasks by fitting a density estimator on Hamiltonian Monte Carlo samples. We find that this approach fails to prevent catastrophic forgetting demonstrating the difficulty in performing sequential Bayesian inference in neural networks. From there we study simple analytical examples of sequential Bayesian inference and CL and highlight the issue of model misspecification which can lead to sub-optimal continual learning performance despite exact inference. Furthermore, we discuss how task data imbalances can cause forgetting. From these limitations, we argue that we need probabilistic models of the continual learning generative process rather than relying on sequential Bayesian inference over Bayesian neural network weights. In this vein, we also propose a simple baseline called Prototypical Bayesian Continual Learning, which is competitive with state-of-the-art Bayesian continual learning methods on class incremental continual learning vision benchmarks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2301_01828
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	On Sequential Bayesian Inference for Continual Learning Kessler, Samuel Cobb, Adam Rudner, Tim G. J. Zohren, Stefan Roberts, Stephen J. Machine Learning Artificial Intelligence Sequential Bayesian inference can be used for continual learning to prevent catastrophic forgetting of past tasks and provide an informative prior when learning new tasks. We revisit sequential Bayesian inference and test whether having access to the true posterior is guaranteed to prevent catastrophic forgetting in Bayesian neural networks. To do this we perform sequential Bayesian inference using Hamiltonian Monte Carlo. We propagate the posterior as a prior for new tasks by fitting a density estimator on Hamiltonian Monte Carlo samples. We find that this approach fails to prevent catastrophic forgetting demonstrating the difficulty in performing sequential Bayesian inference in neural networks. From there we study simple analytical examples of sequential Bayesian inference and CL and highlight the issue of model misspecification which can lead to sub-optimal continual learning performance despite exact inference. Furthermore, we discuss how task data imbalances can cause forgetting. From these limitations, we argue that we need probabilistic models of the continual learning generative process rather than relying on sequential Bayesian inference over Bayesian neural network weights. In this vein, we also propose a simple baseline called Prototypical Bayesian Continual Learning, which is competitive with state-of-the-art Bayesian continual learning methods on class incremental continual learning vision benchmarks.
title	On Sequential Bayesian Inference for Continual Learning
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2301.01828

Similar Items