Number sequence representation of protein structures based on the second derivative of a folded tetrahedron sequence
Abstract
This paper proposes a new mathematical approach to characterize native protein structures based on the discrete differential geometry of tetrahedron tiles. In the approach, local structure of proteins is classified into finite types according to shape. And one would obtain a number sequence representation of protein structures automatically. As a result, it would become possible to quantify structural preference of amino-acids objectively. And one could use the wide variety of sequence alignment programs to study protein structures since the number sequence has no internal structure. The programs and this paper with clear figures are available from http://www.genocript.com.
- Publication:
-
arXiv e-prints
- Pub Date:
- October 2006
- DOI:
- 10.48550/arXiv.q-bio/0610017
- arXiv:
- arXiv:q-bio/0610017
- Bibcode:
- 2006q.bio....10017M
- Keywords:
-
- Quantitative Biology - Biomolecules;
- Computer Science - Computational Geometry;
- Computer Science - Discrete Mathematics;
- Mathematics - Metric Geometry
- E-Print:
- 11 pages (2400 words + 3 figures + 4tables)