Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Becker, Marlon, Risse, Benjamin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2411.19640
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909408688078848
author	Becker, Marlon Risse, Benjamin
author_facet	Becker, Marlon Risse, Benjamin
contents	We empirically investigate the impact of learning randomly generated labels in parallel to class labels in supervised learning on memorization, model complexity, and generalization in deep neural networks. To this end, we introduce a multi-head network architecture as an extension of standard CNN architectures. Inspired by methods used in fair AI, our approach allows for the unlearning of random labels, preventing the network from memorizing individual samples. Based on the concept of Rademacher complexity, we first use our proposed method as a complexity metric to analyze the effects of common regularization techniques and challenge the traditional understanding of feature extraction and classification in CNNs. Second, we propose a novel regularizer that effectively reduces sample memorization. However, contrary to the predictions of classical statistical learning theory, we do not observe improvements in generalization.
format	Preprint
id	arxiv_https___arxiv_org_abs_2411_19640
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Learned Random Label Predictions as a Neural Network Complexity Metric Becker, Marlon Risse, Benjamin Machine Learning We empirically investigate the impact of learning randomly generated labels in parallel to class labels in supervised learning on memorization, model complexity, and generalization in deep neural networks. To this end, we introduce a multi-head network architecture as an extension of standard CNN architectures. Inspired by methods used in fair AI, our approach allows for the unlearning of random labels, preventing the network from memorizing individual samples. Based on the concept of Rademacher complexity, we first use our proposed method as a complexity metric to analyze the effects of common regularization techniques and challenge the traditional understanding of feature extraction and classification in CNNs. Second, we propose a novel regularizer that effectively reduces sample memorization. However, contrary to the predictions of classical statistical learning theory, we do not observe improvements in generalization.
title	Learned Random Label Predictions as a Neural Network Complexity Metric
topic	Machine Learning
url	https://arxiv.org/abs/2411.19640

Similar Items