Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Meni, Mackenzie J., White, Ryan T., Mayo, Michael, Pilkiewicz, Kevin
Format:	Preprint
Published:	2023
Subjects:	Computer Vision and Pattern Recognition Machine Learning Image and Video Processing
Online Access:	https://arxiv.org/abs/2308.14938
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929408624820224
author	Meni, Mackenzie J. White, Ryan T. Mayo, Michael Pilkiewicz, Kevin
author_facet	Meni, Mackenzie J. White, Ryan T. Mayo, Michael Pilkiewicz, Kevin
contents	Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building and training them are not straightforward processes. To add structure to these efforts, we derive new mathematical results to efficiently measure the changes in entropy as fully-connected and convolutional neural networks process data. By measuring the change in entropy as networks process data effectively, patterns critical to a well-performing network can be visualized and identified. Entropy-based loss terms are developed to improve dense and convolutional model accuracy and efficiency by promoting the ideal entropy patterns. Experiments in image compression, image classification, and image segmentation on benchmark datasets demonstrate these losses guide neural networks to learn rich latent data representations in fewer dimensions, converge in fewer training epochs, and achieve higher accuracy.
format	Preprint
id	arxiv_https___arxiv_org_abs_2308_14938
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance Meni, Mackenzie J. White, Ryan T. Mayo, Michael Pilkiewicz, Kevin Computer Vision and Pattern Recognition Machine Learning Image and Video Processing Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building and training them are not straightforward processes. To add structure to these efforts, we derive new mathematical results to efficiently measure the changes in entropy as fully-connected and convolutional neural networks process data. By measuring the change in entropy as networks process data effectively, patterns critical to a well-performing network can be visualized and identified. Entropy-based loss terms are developed to improve dense and convolutional model accuracy and efficiency by promoting the ideal entropy patterns. Experiments in image compression, image classification, and image segmentation on benchmark datasets demonstrate these losses guide neural networks to learn rich latent data representations in fewer dimensions, converge in fewer training epochs, and achieve higher accuracy.
title	Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance
topic	Computer Vision and Pattern Recognition Machine Learning Image and Video Processing
url	https://arxiv.org/abs/2308.14938

Similar Items