Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nai, Ruiqian, Wen, Zixin, Li, Ji, Li, Yuanzhi, Gao, Yang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2403.00352
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914698184622080
author	Nai, Ruiqian Wen, Zixin Li, Ji Li, Yuanzhi Gao, Yang
author_facet	Nai, Ruiqian Wen, Zixin Li, Ji Li, Yuanzhi Gao, Yang
contents	In representation learning, a disentangled representation is highly desirable as it encodes generative factors of data in a separable and compact pattern. Researchers have advocated leveraging disentangled representations to complete downstream tasks with encouraging empirical evidence. This paper further investigates the necessity of disentangled representation in downstream applications. Specifically, we show that dimension-wise disentangled representations are unnecessary on a fundamental downstream task, abstract visual reasoning. We provide extensive empirical evidence against the necessity of disentanglement, covering multiple datasets, representation learning methods, and downstream network architectures. Furthermore, our findings suggest that the informativeness of representations is a better indicator of downstream performance than disentanglement. Finally, the positive correlation between informativeness and disentanglement explains the claimed usefulness of disentangled representations in previous works. The source code is available at https://github.com/Richard-coder-Nai/disentanglement-lib-necessity.git.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_00352
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning Nai, Ruiqian Wen, Zixin Li, Ji Li, Yuanzhi Gao, Yang Computer Vision and Pattern Recognition Machine Learning In representation learning, a disentangled representation is highly desirable as it encodes generative factors of data in a separable and compact pattern. Researchers have advocated leveraging disentangled representations to complete downstream tasks with encouraging empirical evidence. This paper further investigates the necessity of disentangled representation in downstream applications. Specifically, we show that dimension-wise disentangled representations are unnecessary on a fundamental downstream task, abstract visual reasoning. We provide extensive empirical evidence against the necessity of disentanglement, covering multiple datasets, representation learning methods, and downstream network architectures. Furthermore, our findings suggest that the informativeness of representations is a better indicator of downstream performance than disentanglement. Finally, the positive correlation between informativeness and disentanglement explains the claimed usefulness of disentangled representations in previous works. The source code is available at https://github.com/Richard-coder-Nai/disentanglement-lib-necessity.git.
title	Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
topic	Computer Vision and Pattern Recognition Machine Learning
url	https://arxiv.org/abs/2403.00352

Similar Items