A Survey of Pretraining on Graphs: Taxonomy, Methods, and Applications
Abstract
Pretrained Language Models (PLMs) such as BERT have revolutionized the landscape of Natural Language Processing (NLP). Inspired by their proliferation, tremendous efforts have been devoted to Pretrained Graph Models (PGMs). Owing to the powerful model architectures of PGMs, abundant knowledge from massive labeled and unlabeled graph data can be captured. The knowledge implicitly encoded in model parameters can benefit various downstream tasks and help to alleviate several fundamental issues of learning on graphs. In this paper, we provide the first comprehensive survey for PGMs. We firstly present the limitations of graph representation learning and thus introduce the motivation for graph pre-training. Then, we systematically categorize existing PGMs based on a taxonomy from four different perspectives. Next, we present the applications of PGMs in social recommendation and drug discovery. Finally, we outline several promising research directions that can serve as a guideline for future research.
- Publication:
-
arXiv e-prints
- Pub Date:
- February 2022
- DOI:
- 10.48550/arXiv.2202.07893
- arXiv:
- arXiv:2202.07893
- Bibcode:
- 2022arXiv220207893X
- Keywords:
-
- Computer Science - Machine Learning;
- Computer Science - Social and Information Networks;
- Quantitative Biology - Biomolecules
- E-Print:
- 9 pages. Submitted to IJCAI 2022 (Survey Track)