Machine learning identified molecular fragments responsible for infrared emission features of polycyclic aromatic hydrocarbons
Abstract
Machine learning feature importance calculations are used to determine the molecular substructures that are responsible for mid- and far-infrared (IR) emission features of neutral polycyclic aromatic hydrocarbons (PAHs). Using the extended-connectivity fingerprint as a descriptor of chemical structure, a random forest model is trained on the spectra of 14 124 PAHs to evaluate the importance of 10 632 molecular fragments for each band within the range of 2.761 to $1172.745\, \mu$m. The accuracy of the results is confirmed by comparing them with previously studied unidentified infrared emission (UIE) bands. The results are summarized in two tables available as Supplementary Data, which can be used as a reference for assessing possible UIE carriers. We demonstrate that the tables can be used to explore the relation between the PAH structure and the spectra by discussing about the IR features of nitrogen-containing PAHs and superhydrogenated PAHs.
- Publication:
-
Monthly Notices of the Royal Astronomical Society
- Pub Date:
- October 2023
- DOI:
- 10.1093/mnrasl/slad089
- arXiv:
- arXiv:2307.08277
- Bibcode:
- 2023MNRAS.525L..29M
- Keywords:
-
- astronomical data bases: miscellaneous;
- software: data analysis;
- ISM: molecules;
- infrared: ISM;
- Astrophysics - Astrophysics of Galaxies;
- Astrophysics - Instrumentation and Methods for Astrophysics;
- Astrophysics - Solar and Stellar Astrophysics
- E-Print:
- MNRAS 525, L29-L35 (2023)