Saved in:
Bibliographic Details
Main Authors: Jahromi, Saeed S., Orus, Roman
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2211.14657
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913470151131136
author Jahromi, Saeed S.
Orus, Roman
author_facet Jahromi, Saeed S.
Orus, Roman
contents Deep neural networks (NNs) encounter scalability limitations when confronted with a vast array of neurons, thereby constraining their achievable network depth. To address this challenge, we propose an integration of tensor networks (TN) into NN frameworks, combined with a variational DMRG-inspired training technique. This in turn, results in a scalable tensor neural network (TNN) architecture capable of efficient training over a large parameter space. Our variational algorithm utilizes a local gradient-descent technique, enabling manual or automatic computation of tensor gradients, facilitating design of hybrid TNN models with combined dense and tensor layers. Our training algorithm further provides insight on the entanglement structure of the tensorized trainable weights and correlation among the model parameters. We validate the accuracy and efficiency of our method by designing TNN models and providing benchmark results for linear and non-linear regressions, data classification and image recognition on MNIST handwritten digits.
format Preprint
id arxiv_https___arxiv_org_abs_2211_14657
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Variational Tensor Neural Networks for Deep Learning
Jahromi, Saeed S.
Orus, Roman
Disordered Systems and Neural Networks
Quantum Physics
Deep neural networks (NNs) encounter scalability limitations when confronted with a vast array of neurons, thereby constraining their achievable network depth. To address this challenge, we propose an integration of tensor networks (TN) into NN frameworks, combined with a variational DMRG-inspired training technique. This in turn, results in a scalable tensor neural network (TNN) architecture capable of efficient training over a large parameter space. Our variational algorithm utilizes a local gradient-descent technique, enabling manual or automatic computation of tensor gradients, facilitating design of hybrid TNN models with combined dense and tensor layers. Our training algorithm further provides insight on the entanglement structure of the tensorized trainable weights and correlation among the model parameters. We validate the accuracy and efficiency of our method by designing TNN models and providing benchmark results for linear and non-linear regressions, data classification and image recognition on MNIST handwritten digits.
title Variational Tensor Neural Networks for Deep Learning
topic Disordered Systems and Neural Networks
Quantum Physics
url https://arxiv.org/abs/2211.14657