Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bourgeois, Daniel, Ding, Zhimin, Jankov, Dimitrije, Li, Jiehui, Sleem, Mahmoud, Tang, Yuxin, Yao, Jiawen, Yao, Xinyu, Jermaine, Chris
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Distributed, Parallel, and Cluster Computing
Online-Zugang:	https://arxiv.org/abs/2410.02682
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866916422265864192
author	Bourgeois, Daniel Ding, Zhimin Jankov, Dimitrije Li, Jiehui Sleem, Mahmoud Tang, Yuxin Yao, Jiawen Yao, Xinyu Jermaine, Chris
author_facet	Bourgeois, Daniel Ding, Zhimin Jankov, Dimitrije Li, Jiehui Sleem, Mahmoud Tang, Yuxin Yao, Jiawen Yao, Xinyu Jermaine, Chris
contents	We consider the problem of automatically decomposing operations over tensors or arrays so that they can be executed in parallel on multiple devices. We address two, closely-linked questions. First, what programming abstraction should systems for tensor-based computing offer to enable such decompositions? Second, given that abstraction, how should such systems automatically decompose a tensor-based computation? We assert that tensor-based systems should offer a programming abstraction based on an extended Einstein summation notation, which is a fully declarative, mathematical specification for tensor computations. We show that any computation specified in the Einstein summation notation can be re-written into an equivalent tensor-relational computation, and this re-write generalizes existing notations of tensor parallelism such as "data parallel'' and "model parallel.'' We consider the algorithmic problem of optimally computing a tensor-relational decomposition of a graph of operations specified in our extended Einstein summation notation, and we experimentally show the value of the algorithm that we develop.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_02682
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution Bourgeois, Daniel Ding, Zhimin Jankov, Dimitrije Li, Jiehui Sleem, Mahmoud Tang, Yuxin Yao, Jiawen Yao, Xinyu Jermaine, Chris Distributed, Parallel, and Cluster Computing We consider the problem of automatically decomposing operations over tensors or arrays so that they can be executed in parallel on multiple devices. We address two, closely-linked questions. First, what programming abstraction should systems for tensor-based computing offer to enable such decompositions? Second, given that abstraction, how should such systems automatically decompose a tensor-based computation? We assert that tensor-based systems should offer a programming abstraction based on an extended Einstein summation notation, which is a fully declarative, mathematical specification for tensor computations. We show that any computation specified in the Einstein summation notation can be re-written into an equivalent tensor-relational computation, and this re-write generalizes existing notations of tensor parallelism such as "data parallel'' and "model parallel.'' We consider the algorithmic problem of optimally computing a tensor-relational decomposition of a graph of operations specified in our extended Einstein summation notation, and we experimentally show the value of the algorithm that we develop.
title	EinDecomp: Decomposition of Declaratively-Specified Machine Learning and Numerical Computations for Parallel Execution
topic	Distributed, Parallel, and Cluster Computing
url	https://arxiv.org/abs/2410.02682

Ähnliche Einträge