Epistatic models predict mutable sites in SARS-CoV-2 proteins and epitopes
Abstract
During the COVID pandemic, new severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants emerge and spread, some being of major concern due to their increased infectivity or capacity to reduce vaccine efficiency. Anticipating mutations, which might give rise to new variants, would be of great interest. We construct sequence models predicting how mutable SARS-CoV-2 positions are, using a single SARS-CoV-2 sequence and databases of other coronaviruses. Predictions are tested against available mutagenesis data and the observed variability of SARS-CoV-2 proteins. Interestingly, predictions agree increasingly with observations, as more SARS-CoV-2 sequences become available. Combining predictions with immunological data, we find an overrepresentation of mutations in current variants of concern. The approach may become relevant for potential outbreaks of future viral diseases.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- January 2022
- DOI:
- arXiv:
- arXiv:2112.10093
- Bibcode:
- 2022PNAS..11913118R
- Keywords:
-
- Quantitative Biology - Genomics;
- Quantitative Biology - Populations and Evolution
- E-Print:
- 21 pages + supplementary information