Saved in:
Bibliographic Details
Main Authors: Chen, Louis L., Chern, Bobbie, Eckstrand, Eric, Mahapatra, Amogh, Royset, Johannes O.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2405.20531
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912232566161408
author Chen, Louis L.
Chern, Bobbie
Eckstrand, Eric
Mahapatra, Amogh
Royset, Johannes O.
author_facet Chen, Louis L.
Chern, Bobbie
Eckstrand, Eric
Mahapatra, Amogh
Royset, Johannes O.
contents Labeling errors in datasets are common, arising in a variety of contexts, such as human labeling, noisy labeling, and weak labeling (i.e., image classification). Although neural networks (NNs) can tolerate modest amounts of these errors, their performance degrades substantially once error levels exceed a certain threshold. We propose a new loss reweighting, architecture-independent methodology, Rockafellian Relaxation Method (RRM) for neural network training. Experiments indicate RRM can enhance neural network methods to achieve robust performance across classification tasks in computer vision and natural language processing (sentiment analysis). We find that RRM can mitigate the effects of dataset contamination stemming from both (heavy) labeling error and/or adversarial perturbation, demonstrating effectiveness across a variety of data domains and machine learning tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2405_20531
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation
Chen, Louis L.
Chern, Bobbie
Eckstrand, Eric
Mahapatra, Amogh
Royset, Johannes O.
Machine Learning
Labeling errors in datasets are common, arising in a variety of contexts, such as human labeling, noisy labeling, and weak labeling (i.e., image classification). Although neural networks (NNs) can tolerate modest amounts of these errors, their performance degrades substantially once error levels exceed a certain threshold. We propose a new loss reweighting, architecture-independent methodology, Rockafellian Relaxation Method (RRM) for neural network training. Experiments indicate RRM can enhance neural network methods to achieve robust performance across classification tasks in computer vision and natural language processing (sentiment analysis). We find that RRM can mitigate the effects of dataset contamination stemming from both (heavy) labeling error and/or adversarial perturbation, demonstrating effectiveness across a variety of data domains and machine learning tasks.
title Mitigating the Impact of Labeling Errors on Training via Rockafellian Relaxation
topic Machine Learning
url https://arxiv.org/abs/2405.20531