Synthetic Graph Generation to Benchmark Graph Learning

doi:10.48550/arXiv.2204.01376

Synthetic Graph Generation to Benchmark Graph Learning

Graph learning algorithms have attained state-of-the-art performance on many graph analysis tasks such as node classification, link prediction, and clustering. It has, however, become hard to track the field's burgeoning progress. One reason is due to the very small number of datasets used in practice to benchmark the performance of graph learning algorithms. This shockingly small sample size (~10) allows for only limited scientific insight into the problem. In this work, we aim to address this deficiency. We propose to generate synthetic graphs, and study the behaviour of graph learning algorithms in a controlled scenario. We develop a fully-featured synthetic graph generator that allows deep inspection of different models. We argue that synthetic graph generations allows for thorough investigation of algorithms and provides more insights than overfitting on three citation datasets. In the case study, we show how our framework provides insight into unsupervised and supervised graph neural network models.

Publication:

arXiv e-prints

Pub Date:

April 2022

DOI:

10.48550/arXiv.2204.01376

arXiv:

arXiv:2204.01376

Bibcode:

2022arXiv220401376T

Keywords:

Computer Science - Machine Learning;
Computer Science - Social and Information Networks

E-Print:

4 pages. Appeared at the GLB'21 workshop

NASA/ADS

Synthetic Graph Generation to Benchmark Graph Learning

Abstract