Loop-Diffusion: an equivariant diffusion model for designing and scoring protein loops
Abstract
Predicting protein functional characteristics from structure remains a central problem in protein science, with broad implications from understanding the mechanisms of disease to designing novel therapeutics. Unfortunately, current machine learning methods are limited by scarce and biased experimental data, and physics-based methods are either too slow to be useful, or too simplified to be accurate. In this work, we present Loop-Diffusion, an energy based diffusion model which leverages a dataset of general protein loops from the entire protein universe to learn an energy function that generalizes to functional prediction tasks. We evaluate Loop-Diffusion's performance on scoring TCR-pMHC interfaces and demonstrate state-of-the-art results in recognizing binding-enhancing mutations.
- Publication:
-
arXiv e-prints
- Pub Date:
- September 2024
- DOI:
- 10.48550/arXiv.2409.18201
- arXiv:
- arXiv:2409.18201
- Bibcode:
- 2024arXiv240918201B
- Keywords:
-
- Physics - Biological Physics;
- Computer Science - Machine Learning;
- Quantitative Biology - Quantitative Methods