Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Bicici, Ufuk Can, Meral, Tuna Han Salih, Akarun, Lale
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2402.08345
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917714530926592
author	Bicici, Ufuk Can Meral, Tuna Han Salih Akarun, Lale
author_facet	Bicici, Ufuk Can Meral, Tuna Han Salih Akarun, Lale
contents	Conditional computing processes an input using only part of the neural network's computational units. Learning to execute parts of a deep convolutional network by routing individual samples has several advantages: Reducing the computational burden is an obvious advantage. Furthermore, if similar classes are routed to the same path, that part of the network learns to discriminate between finer differences and better classification accuracies can be attained with fewer parameters. Recently, several papers have exploited this idea to take a particular child of a node in a tree-shaped network or to skip parts of a network. In this work, we follow a Trellis-based approach for generating specific execution paths in a deep convolutional neural network. We have designed routing mechanisms that use differentiable information gain-based cost functions to determine which subset of features in a convolutional layer will be executed. We call our method Conditional Information Gain Trellis (CIGT). We show that our conditional execution mechanism achieves comparable or better model performance compared to unconditional baselines, using only a fraction of the computational resources.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_08345
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Conditional Information Gain Trellis Bicici, Ufuk Can Meral, Tuna Han Salih Akarun, Lale Computer Vision and Pattern Recognition Conditional computing processes an input using only part of the neural network's computational units. Learning to execute parts of a deep convolutional network by routing individual samples has several advantages: Reducing the computational burden is an obvious advantage. Furthermore, if similar classes are routed to the same path, that part of the network learns to discriminate between finer differences and better classification accuracies can be attained with fewer parameters. Recently, several papers have exploited this idea to take a particular child of a node in a tree-shaped network or to skip parts of a network. In this work, we follow a Trellis-based approach for generating specific execution paths in a deep convolutional neural network. We have designed routing mechanisms that use differentiable information gain-based cost functions to determine which subset of features in a convolutional layer will be executed. We call our method Conditional Information Gain Trellis (CIGT). We show that our conditional execution mechanism achieves comparable or better model performance compared to unconditional baselines, using only a fraction of the computational resources.
title	Conditional Information Gain Trellis
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2402.08345

Similar Items