Generative Antibody Design for Complementary Chain Pairing Sequences through Encoder-Decoder Language Model
Abstract
Current protein language models (pLMs) predominantly focus on single-chain protein sequences and often have not accounted for constraints on generative design imposed by protein-protein interactions. To address this gap, we present paired Antibody T5 (pAbT5), an encoder-decoder model to generate complementary heavy or light chain from its pairing partner. We show that our model respects conservation in framework regions and variability in hypervariable domains, demonstrated by agreement with sequence alignment and variable-length CDR loops. We also show that our model captures chain pairing preferences through the recovery of ground-truth chain type and gene families. Our results showcase the potential of pAbT5 in generative antibody design, incorporating biological constraints from chain pairing preferences.
- Publication:
-
arXiv e-prints
- Pub Date:
- January 2023
- DOI:
- 10.48550/arXiv.2301.02748
- arXiv:
- arXiv:2301.02748
- Bibcode:
- 2023arXiv230102748C
- Keywords:
-
- Quantitative Biology - Biomolecules;
- Computer Science - Computational Engineering;
- Finance;
- and Science;
- Computer Science - Computation and Language