MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Hu, Zhengyang, Kang, Song, Zeng, Qunsong, Huang, Kaibin, Yang, Yanchao
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Information Theory
Accesso online:	https://arxiv.org/abs/2402.10158
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866913235223969792
author	Hu, Zhengyang Kang, Song Zeng, Qunsong Huang, Kaibin Yang, Yanchao
author_facet	Hu, Zhengyang Kang, Song Zeng, Qunsong Huang, Kaibin Yang, Yanchao
contents	Estimating mutual correlations between random variables or data streams is essential for intelligent behavior and decision-making. As a fundamental quantity for measuring statistical relationships, mutual information has been extensively studied and utilized for its generality and equitability. However, existing methods often lack the efficiency needed for real-time applications, such as test-time optimization of a neural network, or the differentiability required for end-to-end learning, like histograms. We introduce a neural network called InfoNet, which directly outputs mutual information estimations of data streams by leveraging the attention mechanism and the computational efficiency of deep learning infrastructures. By maximizing a dual formulation of mutual information through large-scale simulated training, our approach circumvents time-consuming test-time optimization and offers generalization ability. We evaluate the effectiveness and generalization of our proposed mutual information estimation scheme on various families of distributions and applications. Our results demonstrate that InfoNet and its training process provide a graceful efficiency-accuracy trade-off and order-preserving properties. We will make the code and models available as a comprehensive toolbox to facilitate studies in different fields requiring real-time mutual information estimation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_10158
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization Hu, Zhengyang Kang, Song Zeng, Qunsong Huang, Kaibin Yang, Yanchao Information Theory Estimating mutual correlations between random variables or data streams is essential for intelligent behavior and decision-making. As a fundamental quantity for measuring statistical relationships, mutual information has been extensively studied and utilized for its generality and equitability. However, existing methods often lack the efficiency needed for real-time applications, such as test-time optimization of a neural network, or the differentiability required for end-to-end learning, like histograms. We introduce a neural network called InfoNet, which directly outputs mutual information estimations of data streams by leveraging the attention mechanism and the computational efficiency of deep learning infrastructures. By maximizing a dual formulation of mutual information through large-scale simulated training, our approach circumvents time-consuming test-time optimization and offers generalization ability. We evaluate the effectiveness and generalization of our proposed mutual information estimation scheme on various families of distributions and applications. Our results demonstrate that InfoNet and its training process provide a graceful efficiency-accuracy trade-off and order-preserving properties. We will make the code and models available as a comprehensive toolbox to facilitate studies in different fields requiring real-time mutual information estimation.
title	InfoNet: Neural Estimation of Mutual Information without Test-Time Optimization
topic	Information Theory
url	https://arxiv.org/abs/2402.10158

Documenti analoghi