Saved in:
Bibliographic Details
Main Author: Lazebnik, Teddy
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.04534
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911130707820544
author Lazebnik, Teddy
author_facet Lazebnik, Teddy
contents Detecting and understanding out-of-distribution (OOD) samples is crucial in machine learning (ML) to ensure reliable model performance. Current OOD studies primarily focus on extrapolatory (outside) OOD, neglecting potential cases of interpolatory (inside) OOD. In this study, we introduce a novel perspective on OOD by suggesting it can be divided into inside and outside cases. We examine the inside-outside OOD profiles of datasets and their impact on ML model performance, using normalized Root Mean Squared Error (RMSE) and F1 score as the performance metrics on syntetically-generated datasets with both inside and outside OOD. Our analysis demonstrates that different inside-outside OOD profiles lead to unique effects on ML model performance, with outside OOD generally causing greater performance degradation, on average. These findings highlight the importance of distinguishing between inside and outside OOD for developing effective counter-OOD methods.
format Preprint
id arxiv_https___arxiv_org_abs_2407_04534
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Introducing 'Inside' Out of Distribution
Lazebnik, Teddy
Machine Learning
Detecting and understanding out-of-distribution (OOD) samples is crucial in machine learning (ML) to ensure reliable model performance. Current OOD studies primarily focus on extrapolatory (outside) OOD, neglecting potential cases of interpolatory (inside) OOD. In this study, we introduce a novel perspective on OOD by suggesting it can be divided into inside and outside cases. We examine the inside-outside OOD profiles of datasets and their impact on ML model performance, using normalized Root Mean Squared Error (RMSE) and F1 score as the performance metrics on syntetically-generated datasets with both inside and outside OOD. Our analysis demonstrates that different inside-outside OOD profiles lead to unique effects on ML model performance, with outside OOD generally causing greater performance degradation, on average. These findings highlight the importance of distinguishing between inside and outside OOD for developing effective counter-OOD methods.
title Introducing 'Inside' Out of Distribution
topic Machine Learning
url https://arxiv.org/abs/2407.04534