Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lai, Zhao-Rong, Wang, Weiwen
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2405.01389
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929346761981952
author	Lai, Zhao-Rong Wang, Weiwen
author_facet	Lai, Zhao-Rong Wang, Weiwen
contents	Invariant risk minimization (IRM) is an arising approach to generalize invariant features to different environments in machine learning. While most related works focus on new IRM settings or new application scenarios, the mathematical essence of IRM remains to be properly explained. We verify that IRM is essentially a total variation based on $L^2$ norm (TV-$\ell_2$) of the learning risk with respect to the classifier variable. Moreover, we propose a novel IRM framework based on the TV-$\ell_1$ model. It not only expands the classes of functions that can be used as the learning risk and the feature extractor, but also has robust performance in denoising and invariant feature preservation based on the coarea formula. We also illustrate some requirements for IRM-TV-$\ell_1$ to achieve out-of-distribution generalization. Experimental results show that the proposed framework achieves competitive performance in several benchmark machine learning scenarios.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_01389
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Invariant Risk Minimization Is A Total Variation Model Lai, Zhao-Rong Wang, Weiwen Machine Learning Invariant risk minimization (IRM) is an arising approach to generalize invariant features to different environments in machine learning. While most related works focus on new IRM settings or new application scenarios, the mathematical essence of IRM remains to be properly explained. We verify that IRM is essentially a total variation based on $L^2$ norm (TV-$\ell_2$) of the learning risk with respect to the classifier variable. Moreover, we propose a novel IRM framework based on the TV-$\ell_1$ model. It not only expands the classes of functions that can be used as the learning risk and the feature extractor, but also has robust performance in denoising and invariant feature preservation based on the coarea formula. We also illustrate some requirements for IRM-TV-$\ell_1$ to achieve out-of-distribution generalization. Experimental results show that the proposed framework achieves competitive performance in several benchmark machine learning scenarios.
title	Invariant Risk Minimization Is A Total Variation Model
topic	Machine Learning
url	https://arxiv.org/abs/2405.01389

Similar Items