Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jiang, RuiZhe, Lei, Haotian
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2412.01476
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929685291597824
author	Jiang, RuiZhe Lei, Haotian
author_facet	Jiang, RuiZhe Lei, Haotian
contents	Over-parameterized neural network models often lead to significant performance discrepancies between training and test sets, a phenomenon known as overfitting. To address this, researchers have proposed numerous regularization techniques tailored to various tasks and model architectures. In this paper, we introduce a simple perspective on overfitting: models learn different representations in different i.i.d. datasets. Based on this viewpoint, we propose an adaptive method, ConsistentFeature, that regularizes the model by constraining feature differences across random subsets of the same training set. Due to minimal prior assumptions, this approach is applicable to almost any architecture and task. Our experiments show that it effectively reduces overfitting, with low sensitivity to hyperparameters and minimal computational cost. It demonstrates particularly strong memory suppression and promotes normal convergence, even when the model has already started to overfit. Even in the absence of significant overfitting, our method consistently improves accuracy and reduces validation loss.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_01476
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	ConsistentFeature: A Plug-and-Play Component for Neural Network Regularization Jiang, RuiZhe Lei, Haotian Machine Learning Over-parameterized neural network models often lead to significant performance discrepancies between training and test sets, a phenomenon known as overfitting. To address this, researchers have proposed numerous regularization techniques tailored to various tasks and model architectures. In this paper, we introduce a simple perspective on overfitting: models learn different representations in different i.i.d. datasets. Based on this viewpoint, we propose an adaptive method, ConsistentFeature, that regularizes the model by constraining feature differences across random subsets of the same training set. Due to minimal prior assumptions, this approach is applicable to almost any architecture and task. Our experiments show that it effectively reduces overfitting, with low sensitivity to hyperparameters and minimal computational cost. It demonstrates particularly strong memory suppression and promotes normal convergence, even when the model has already started to overfit. Even in the absence of significant overfitting, our method consistently improves accuracy and reduces validation loss.
title	ConsistentFeature: A Plug-and-Play Component for Neural Network Regularization
topic	Machine Learning
url	https://arxiv.org/abs/2412.01476

Similar Items