Saved in:
Bibliographic Details
Main Authors: Gupta, Srishti, Balia, Riccardo, Angioni, Daniele, Brau, Fabio, Pintor, Maura, Demontis, Ambra, Sebastian, Alessandro, Carta, Salvatore Mario, Roli, Fabio, Biggio, Battista
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.19725
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915691457675264
author Gupta, Srishti
Balia, Riccardo
Angioni, Daniele
Brau, Fabio
Pintor, Maura
Demontis, Ambra
Sebastian, Alessandro
Carta, Salvatore Mario
Roli, Fabio
Biggio, Battista
author_facet Gupta, Srishti
Balia, Riccardo
Angioni, Daniele
Brau, Fabio
Pintor, Maura
Demontis, Ambra
Sebastian, Alessandro
Carta, Salvatore Mario
Roli, Fabio
Biggio, Battista
contents Recent years have witnessed significant progress in the development of machine learning models across a wide range of fields, fueled by increased computational resources, large-scale datasets, and the rise of deep learning architectures. From malware detection to enabling autonomous navigation, modern machine learning systems have demonstrated remarkable capabilities. However, as these models are deployed in ever-changing real-world scenarios, their ability to remain reliable and adaptive over time becomes increasingly important. For example, in the real world, new malware families are continuously developed, whereas autonomous driving cars are employed in many different cities and weather conditions. Models trained in fixed settings can not respond effectively to novel conditions encountered post-deployment. In fact, most machine learning models are still developed under the assumption that training and test data are independent and identically distributed (i.i.d.), i.e., sampled from the same underlying (unknown) distribution. While this assumption simplifies model development and evaluation, it does not hold in many real-world applications, where data changes over time and unexpected inputs frequently occur. Retraining models from scratch whenever new data appears is computationally expensive, time-consuming, and impractical in resource-constrained environments. These limitations underscore the need for Continual Learning (CL), which enables models to incrementally learn from evolving data streams without forgetting past knowledge, and Out-of-Distribution (OOD) detection, which allows systems to identify and respond to novel or anomalous inputs. Jointly addressing both challenges is critical to developing robust, efficient, and adaptive AI systems.
format Preprint
id arxiv_https___arxiv_org_abs_2512_19725
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Out-of-Distribution Detection for Continual Learning: Design Principles and Benchmarking
Gupta, Srishti
Balia, Riccardo
Angioni, Daniele
Brau, Fabio
Pintor, Maura
Demontis, Ambra
Sebastian, Alessandro
Carta, Salvatore Mario
Roli, Fabio
Biggio, Battista
Machine Learning
Recent years have witnessed significant progress in the development of machine learning models across a wide range of fields, fueled by increased computational resources, large-scale datasets, and the rise of deep learning architectures. From malware detection to enabling autonomous navigation, modern machine learning systems have demonstrated remarkable capabilities. However, as these models are deployed in ever-changing real-world scenarios, their ability to remain reliable and adaptive over time becomes increasingly important. For example, in the real world, new malware families are continuously developed, whereas autonomous driving cars are employed in many different cities and weather conditions. Models trained in fixed settings can not respond effectively to novel conditions encountered post-deployment. In fact, most machine learning models are still developed under the assumption that training and test data are independent and identically distributed (i.i.d.), i.e., sampled from the same underlying (unknown) distribution. While this assumption simplifies model development and evaluation, it does not hold in many real-world applications, where data changes over time and unexpected inputs frequently occur. Retraining models from scratch whenever new data appears is computationally expensive, time-consuming, and impractical in resource-constrained environments. These limitations underscore the need for Continual Learning (CL), which enables models to incrementally learn from evolving data streams without forgetting past knowledge, and Out-of-Distribution (OOD) detection, which allows systems to identify and respond to novel or anomalous inputs. Jointly addressing both challenges is critical to developing robust, efficient, and adaptive AI systems.
title Out-of-Distribution Detection for Continual Learning: Design Principles and Benchmarking
topic Machine Learning
url https://arxiv.org/abs/2512.19725