Searching molecular structure databases with tandem mass spectra using CSI:FingerID
Abstract
Untargeted metabolomics experiments usually rely on tandem MS (MS/MS) to identify the thousands of compounds in a biological sample. Today, the vast majority of metabolites remain unknown. Recently, several computational approaches were presented for searching molecular structure databases using MS/MS data. Here, we present CSI:FingerID, which combines fragmentation tree computation and machine learning. An in-depth evaluation on two large-scale datasets shows that our method can find 150% more correct identifications than the second-best search method. In comparison with the two runner-up methods, CSI:FingerID reaches 5.4-fold more unique identifications. We also present evaluations indicating that the performance of our method will further improve when more training data become available. CSI:FingerID is publicly available at www.csi-fingerid.org.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- October 2015
- DOI:
- 10.1073/pnas.1509788112
- Bibcode:
- 2015PNAS..11212580D