Representations of Materials for Machine Learning
Abstract
High-throughput data generation methods and machine learning (ML) algorithms have given rise to a new era of computational materials science by learning the relations between composition, structure, and properties and by exploiting such relations for design. However, to build these connections, materials data must be translated into a numerical form, called a representation, that can be processed by an ML model. Data sets in materials science vary in format (ranging from images to spectra), size, and fidelity. Predictive models vary in scope and properties of interest. Here, we review context-dependent strategies for constructing representations that enable the use of materials as inputs or outputs for ML models. Furthermore, we discuss how modern ML techniques can learn representations from data and transfer chemical and physical information between tasks. Finally, we outline high-impact questions that have not been fully resolved and thus require further investigation.
- Publication:
-
Annual Review of Materials Research
- Pub Date:
- July 2023
- DOI:
- 10.1146/annurev-matsci-080921-085947
- arXiv:
- arXiv:2301.08813
- Bibcode:
- 2023AnRMS..53..399D
- Keywords:
-
- Condensed Matter - Materials Science
- E-Print:
- 20 pages, 5 figures, To Appear in Annual Review of Materials Research 53