Physics guided dual Self-supervised learning for structure-based materials property prediction
Abstract
Deep learning (DL) models have now been widely used for high-performance material property prediction for properties such as formation energy and band gap. However, training such DL models usually requires a large amount of labeled data, which is usually not available for most materials properties such as exfoliation energy and elastic properties. Self-supervised learning (SSL) methods have been proposed to address this data scarcity issue by learning inherent representations from unlabeled data in various research fields. Herein, we present DSSL, a physics-guided Dual SSL framework, for graph neural networks (GNNs) based material property prediction. This hybrid framework combines node-masking based predictive SSL with atomic coordinate perturbation based contrastive SSL strategies, allowing it to learn structural embeddings that capture both local and global information of input crystals. Especially, we propose to use predicting the macroproperty (e.g. elasticity) related microproperty such as atomic stiffness as an additional pretext task to achieve physics-guided pretraining process. We pretrain our DSSL model on the Materials Project database with unlabeled data and finetune it with ten extra datasets with different material properties. The experimental results demonstrate that teaching neural networks some physics using the SSL strategy can bring up to 26.89\% performance improvement compared to the baseline GNN models. Our source code is now freely available at https://github.com/usccolumbia/DSSL
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2024
- DOI:
- arXiv:
- arXiv:2401.05223
- Bibcode:
- 2024arXiv240105223F
- Keywords:
-
- Condensed Matter - Materials Science
- E-Print:
- 11 pages