Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Song, Ruizhuo, Yuan, Beiming
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.03190
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908281800228864
author	Song, Ruizhuo Yuan, Beiming
author_facet	Song, Ruizhuo Yuan, Beiming
contents	This paper introduces innovative frameworks for visual abstract reasoning, aiming to boost deep learning model performance. It emphasizes the importance of separating abstract concept and reasoning feature extraction processes. The effectiveness of the Cross-Feature Network (CFN) and its enhanced version, Triple-CFN, validates this approach. Challenges in visual abstract reasoning arise from complex pattern induction and conflicts in low-dimensional representations. To address these, a dual Expectation-Maximization (EM) process is introduced during CFN training, optimizing module parameters to synthesize non-conflicting concepts. However, the dual EM process may overfit, so mutual and decorrelation supervisions are designed to assist feature extraction, with decorrelation supervision proving effective. Leveraging metadata in Raven's Progressive Matrices (RPM), the paper proposes Meta Triple-CFN, improving reasoning accuracy and interpretability. Additionally, a Re-space layer is designed for feature space construction, further enhancing Triple-CFN's reasoning accuracy. These innovative designs provide effective solutions for abstract reasoning problem solvers, benefiting multiple deep learning domains. Codes are available at: https://github.com/Yuanbeiming/Triple-CFN-Separating-Concepts-and-Features-Enhances-Machine-Abstract-Reasoning-Ability.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_03190
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Triple-CFN: Separating Concepts and Features Enhances Machine Abstract Reasoning Ability Song, Ruizhuo Yuan, Beiming Computer Vision and Pattern Recognition This paper introduces innovative frameworks for visual abstract reasoning, aiming to boost deep learning model performance. It emphasizes the importance of separating abstract concept and reasoning feature extraction processes. The effectiveness of the Cross-Feature Network (CFN) and its enhanced version, Triple-CFN, validates this approach. Challenges in visual abstract reasoning arise from complex pattern induction and conflicts in low-dimensional representations. To address these, a dual Expectation-Maximization (EM) process is introduced during CFN training, optimizing module parameters to synthesize non-conflicting concepts. However, the dual EM process may overfit, so mutual and decorrelation supervisions are designed to assist feature extraction, with decorrelation supervision proving effective. Leveraging metadata in Raven's Progressive Matrices (RPM), the paper proposes Meta Triple-CFN, improving reasoning accuracy and interpretability. Additionally, a Re-space layer is designed for feature space construction, further enhancing Triple-CFN's reasoning accuracy. These innovative designs provide effective solutions for abstract reasoning problem solvers, benefiting multiple deep learning domains. Codes are available at: https://github.com/Yuanbeiming/Triple-CFN-Separating-Concepts-and-Features-Enhances-Machine-Abstract-Reasoning-Ability.
title	Triple-CFN: Separating Concepts and Features Enhances Machine Abstract Reasoning Ability
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.03190

Similar Items