Searching sequence space for protein catalysts
Abstract
Genetic selection was used to explore the probability of finding enzymes in protein sequence space. Large degenerate libraries were prepared by replacing all secondary structure units in a dimeric, helical bundle chorismate mutase with simple binary-patterned modules based on a limited set of four polar and four nonpolar residues. Two-stage in vivo selection yielded catalytically active variants possessing biophysical and kinetic properties typical of the natural enzyme even though ≈80% of the protein originates from the simplified modules and >90% of the protein consists of only eight different amino acids. This study provides a quantitative assessment of the number of sequences compatible with a given fold and implicates previously unidentified residues needed to form a functional active site. Given the extremely low incidence of enzymes in completely unbiased libraries, strategies that combine chemical information with genetic selection, like the one used here, may be generally useful in designing novel protein scaffolds with tailored activities.
- Publication:
-
Proceedings of the National Academy of Science
- Pub Date:
- September 2001
- DOI:
- 10.1073/pnas.191159298
- Bibcode:
- 2001PNAS...9810596T
- Keywords:
-
- Biochemistry