Saved in:
Bibliographic Details
Main Authors: Yang, Wenhan, Hu, Zixuan, Lin, Lilang, Liu, Jiaying, Duan, Ling-Yu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.01017
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916308649508864
author Yang, Wenhan
Hu, Zixuan
Lin, Lilang
Liu, Jiaying
Duan, Ling-Yu
author_facet Yang, Wenhan
Hu, Zixuan
Lin, Lilang
Liu, Jiaying
Duan, Ling-Yu
contents Coding, which targets compressing and reconstructing data, and intelligence, often regarded at an abstract computational level as being centered around model learning and prediction, interweave recently to give birth to a series of significant progress. The recent trends demonstrate the potential homogeneity of these two fields, especially when deep-learning models aid these two categories for better probability modeling. For better understanding and describing from a unified perspective, inspired by the basic generally recognized principles in cognitive psychology, we formulate a novel problem of Coding for Intelligence from the category theory view. Based on the three axioms: existence of ideal coding, existence of practical coding, and compactness promoting generalization, we derive a general framework to understand existing methodologies, namely that, coding captures the intrinsic relationships of objects as much as possible, while ignoring information irrelevant to downstream tasks. This framework helps identify the challenges and essential elements in solving the specific derived Minimal Description Length (MDL) optimization problem from a broader range, providing opportunities to build a more intelligent system for handling multiple tasks/applications with coding ideas/tools. Centering on those elements, we systematically review recent processes of towards optimizing the MDL problem in more comprehensive ways from data, model, and task perspectives, and reveal their impacts on the potential CfI technical routes. After that, we also present new technique paths to fulfill CfI and provide potential solutions with preliminary experimental evidence. Last, further directions and remaining issues are discussed as well. The discussion shows our theory can reveal many phenomena and insights about large foundation models, which mutually corroborate with recent practices in feature learning.
format Preprint
id arxiv_https___arxiv_org_abs_2407_01017
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Coding for Intelligence from the Perspective of Category
Yang, Wenhan
Hu, Zixuan
Lin, Lilang
Liu, Jiaying
Duan, Ling-Yu
Computer Vision and Pattern Recognition
Coding, which targets compressing and reconstructing data, and intelligence, often regarded at an abstract computational level as being centered around model learning and prediction, interweave recently to give birth to a series of significant progress. The recent trends demonstrate the potential homogeneity of these two fields, especially when deep-learning models aid these two categories for better probability modeling. For better understanding and describing from a unified perspective, inspired by the basic generally recognized principles in cognitive psychology, we formulate a novel problem of Coding for Intelligence from the category theory view. Based on the three axioms: existence of ideal coding, existence of practical coding, and compactness promoting generalization, we derive a general framework to understand existing methodologies, namely that, coding captures the intrinsic relationships of objects as much as possible, while ignoring information irrelevant to downstream tasks. This framework helps identify the challenges and essential elements in solving the specific derived Minimal Description Length (MDL) optimization problem from a broader range, providing opportunities to build a more intelligent system for handling multiple tasks/applications with coding ideas/tools. Centering on those elements, we systematically review recent processes of towards optimizing the MDL problem in more comprehensive ways from data, model, and task perspectives, and reveal their impacts on the potential CfI technical routes. After that, we also present new technique paths to fulfill CfI and provide potential solutions with preliminary experimental evidence. Last, further directions and remaining issues are discussed as well. The discussion shows our theory can reveal many phenomena and insights about large foundation models, which mutually corroborate with recent practices in feature learning.
title Coding for Intelligence from the Perspective of Category
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2407.01017