Saved in:
Bibliographic Details
Main Authors: Meni, Mackenzie J., White, Ryan T., Mayo, Michael, Pilkiewicz, Kevin
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2308.14938
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929408624820224
author Meni, Mackenzie J.
White, Ryan T.
Mayo, Michael
Pilkiewicz, Kevin
author_facet Meni, Mackenzie J.
White, Ryan T.
Mayo, Michael
Pilkiewicz, Kevin
contents Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building and training them are not straightforward processes. To add structure to these efforts, we derive new mathematical results to efficiently measure the changes in entropy as fully-connected and convolutional neural networks process data. By measuring the change in entropy as networks process data effectively, patterns critical to a well-performing network can be visualized and identified. Entropy-based loss terms are developed to improve dense and convolutional model accuracy and efficiency by promoting the ideal entropy patterns. Experiments in image compression, image classification, and image segmentation on benchmark datasets demonstrate these losses guide neural networks to learn rich latent data representations in fewer dimensions, converge in fewer training epochs, and achieve higher accuracy.
format Preprint
id arxiv_https___arxiv_org_abs_2308_14938
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance
Meni, Mackenzie J.
White, Ryan T.
Mayo, Michael
Pilkiewicz, Kevin
Computer Vision and Pattern Recognition
Machine Learning
Image and Video Processing
Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building and training them are not straightforward processes. To add structure to these efforts, we derive new mathematical results to efficiently measure the changes in entropy as fully-connected and convolutional neural networks process data. By measuring the change in entropy as networks process data effectively, patterns critical to a well-performing network can be visualized and identified. Entropy-based loss terms are developed to improve dense and convolutional model accuracy and efficiency by promoting the ideal entropy patterns. Experiments in image compression, image classification, and image segmentation on benchmark datasets demonstrate these losses guide neural networks to learn rich latent data representations in fewer dimensions, converge in fewer training epochs, and achieve higher accuracy.
title Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance
topic Computer Vision and Pattern Recognition
Machine Learning
Image and Video Processing
url https://arxiv.org/abs/2308.14938