A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications

doi:10.48550/arXiv.2202.07893

A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications

Pretrained Language Models (PLMs) such as BERT have revolutionized the landscape of Natural Language Processing (NLP). Inspired by their proliferation, tremendous efforts have been devoted to Pretrained Graph Models (PGMs). Owing to the powerful model architectures of PGMs, abundant knowledge from massive labeled and unlabeled graph data can be captured. The knowledge implicitly encoded in model parameters can benefit various downstream tasks and help to alleviate several fundamental issues of learning on graphs. In this paper, we provide the first comprehensive survey for PGMs. We firstly present the limitations of graph representation learning and thus introduce the motivation for graph pre-training. Then, we systematically categorize existing PGMs based on a taxonomy from four different perspectives. Next, we present the applications of PGMs in social recommendation and drug discovery. Finally, we outline several promising research directions that can serve as a guideline for future research.

Publication:

arXiv e-prints

Pub Date:

February 2022

DOI:

10.48550/arXiv.2202.07893

arXiv:

arXiv:2202.07893

Bibcode:

2022arXiv220207893X

Keywords:

Computer Science - Machine Learning;
Computer Science - Social and Information Networks;
Quantitative Biology - Biomolecules

E-Print:

9 pages. Submitted to IJCAI 2022 (Survey Track)

NASA/ADS

A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications

Abstract