Saved in:
Bibliographic Details
Main Authors: Nai, Ruiqian, Wen, Zixin, Li, Ji, Li, Yuanzhi, Gao, Yang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.00352
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914698184622080
author Nai, Ruiqian
Wen, Zixin
Li, Ji
Li, Yuanzhi
Gao, Yang
author_facet Nai, Ruiqian
Wen, Zixin
Li, Ji
Li, Yuanzhi
Gao, Yang
contents In representation learning, a disentangled representation is highly desirable as it encodes generative factors of data in a separable and compact pattern. Researchers have advocated leveraging disentangled representations to complete downstream tasks with encouraging empirical evidence. This paper further investigates the necessity of disentangled representation in downstream applications. Specifically, we show that dimension-wise disentangled representations are unnecessary on a fundamental downstream task, abstract visual reasoning. We provide extensive empirical evidence against the necessity of disentanglement, covering multiple datasets, representation learning methods, and downstream network architectures. Furthermore, our findings suggest that the informativeness of representations is a better indicator of downstream performance than disentanglement. Finally, the positive correlation between informativeness and disentanglement explains the claimed usefulness of disentangled representations in previous works. The source code is available at https://github.com/Richard-coder-Nai/disentanglement-lib-necessity.git.
format Preprint
id arxiv_https___arxiv_org_abs_2403_00352
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Nai, Ruiqian
Wen, Zixin
Li, Ji
Li, Yuanzhi
Gao, Yang
Computer Vision and Pattern Recognition
Machine Learning
In representation learning, a disentangled representation is highly desirable as it encodes generative factors of data in a separable and compact pattern. Researchers have advocated leveraging disentangled representations to complete downstream tasks with encouraging empirical evidence. This paper further investigates the necessity of disentangled representation in downstream applications. Specifically, we show that dimension-wise disentangled representations are unnecessary on a fundamental downstream task, abstract visual reasoning. We provide extensive empirical evidence against the necessity of disentanglement, covering multiple datasets, representation learning methods, and downstream network architectures. Furthermore, our findings suggest that the informativeness of representations is a better indicator of downstream performance than disentanglement. Finally, the positive correlation between informativeness and disentanglement explains the claimed usefulness of disentangled representations in previous works. The source code is available at https://github.com/Richard-coder-Nai/disentanglement-lib-necessity.git.
title Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2403.00352