Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hieu, Nong Minh, Ledent, Antoine, Lei, Yunwen, Ku, Cheng Yeaw
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2412.12014
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916532246806528
author	Hieu, Nong Minh Ledent, Antoine Lei, Yunwen Ku, Cheng Yeaw
author_facet	Hieu, Nong Minh Ledent, Antoine Lei, Yunwen Ku, Cheng Yeaw
contents	In this paper, we present generalization bounds for the unsupervised risk in the Deep Contrastive Representation Learning framework, which employs deep neural networks as representation functions. We approach this problem from two angles. On the one hand, we derive a parameter-counting bound that scales with the overall size of the neural networks. On the other hand, we provide a norm-based bound that scales with the norms of neural networks' weight matrices. Ignoring logarithmic factors, the bounds are independent of $k$, the size of the tuples provided for contrastive learning. To the best of our knowledge, this property is only shared by one other work, which employed a different proof strategy and suffers from very strong exponential dependence on the depth of the network which is due to a use of the peeling technique. Our results circumvent this by leveraging powerful results on covering numbers with respect to uniform norms over samples. In addition, we utilize loss augmentation techniques to further reduce the dependency on matrix norms and the implicit dependence on network depth. In fact, our techniques allow us to produce many bounds for the contrastive learning setting with similar architectural dependencies as in the study of the sample complexity of ordinary loss functions, thereby bridging the gap between the learning theories of contrastive learning and DNNs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_12014
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Generalization Analysis for Deep Contrastive Representation Learning Hieu, Nong Minh Ledent, Antoine Lei, Yunwen Ku, Cheng Yeaw Machine Learning In this paper, we present generalization bounds for the unsupervised risk in the Deep Contrastive Representation Learning framework, which employs deep neural networks as representation functions. We approach this problem from two angles. On the one hand, we derive a parameter-counting bound that scales with the overall size of the neural networks. On the other hand, we provide a norm-based bound that scales with the norms of neural networks' weight matrices. Ignoring logarithmic factors, the bounds are independent of $k$, the size of the tuples provided for contrastive learning. To the best of our knowledge, this property is only shared by one other work, which employed a different proof strategy and suffers from very strong exponential dependence on the depth of the network which is due to a use of the peeling technique. Our results circumvent this by leveraging powerful results on covering numbers with respect to uniform norms over samples. In addition, we utilize loss augmentation techniques to further reduce the dependency on matrix norms and the implicit dependence on network depth. In fact, our techniques allow us to produce many bounds for the contrastive learning setting with similar architectural dependencies as in the study of the sample complexity of ordinary loss functions, thereby bridging the gap between the learning theories of contrastive learning and DNNs.
title	Generalization Analysis for Deep Contrastive Representation Learning
topic	Machine Learning
url	https://arxiv.org/abs/2412.12014

Similar Items