Simulating Execution Time of Tensor Programs using Graph Neural Networks

doi:10.48550/arXiv.1904.11876

Simulating Execution Time of Tensor Programs using Graph Neural Networks

Optimizing the execution time of tensor program, e.g., a convolution, involves finding its optimal configuration. Searching the configuration space exhaustively is typically infeasible in practice. In line with recent research using TVM, we propose to learn a surrogate model to overcome this issue. The model is trained on an acyclic graph called an abstract syntax tree, and utilizes a graph convolutional network to exploit structure in the graph. We claim that a learnable graph-based data processing is a strong competitor to heuristic-based feature extraction. We present a new dataset of graphs corresponding to configurations and their execution time for various tensor programs. We provide baselines for a runtime prediction task.

Publication:

arXiv e-prints

Pub Date:

April 2019

DOI:

10.48550/arXiv.1904.11876

arXiv:

arXiv:1904.11876

Bibcode:

2019arXiv190411876T

Keywords:

Statistics - Machine Learning;
Computer Science - Machine Learning

E-Print:

All authors contributed equally. Accepted as a workshop paper at Representation Learning on Graphs and Manifolds @ ICLR 2019. Fixed values in Table 1

NASA/ADS

Simulating Execution Time of Tensor Programs using Graph Neural Networks

Abstract