TopologyNet: Topology based deep convolutional and multi-task neural networks for biomolecular property predictions
Abstract
Although deep learning approaches have had tremendous success in image, video and audio processing, computer vision, and speech recognition, their applications to three-dimensional (3D) biomolecular structural data sets have been hindered by the entangled geometric complexity and biological complexity. We introduce topology, i.e., element specific persistent homology (ESPH), to untangle geometric complexity and biological complexity. ESPH represents 3D complex geometry by one-dimensional (1D) topological invariants and retains crucial biological information via a multichannel image representation. It is able to reveal hidden structure-function relationships in biomolecules. We further integrate ESPH and convolutional neural networks to construct a multichannel topological neural network (TopologyNet) for the predictions of protein-ligand binding affinities and protein stability changes upon mutation. To overcome the limitations to deep learning arising from small and noisy training sets, we present a multitask topological convolutional neural network (MT-TCNN). We demonstrate that the present TopologyNet architectures outperform other state-of-the-art methods in the predictions of protein-ligand binding affinities, globular protein mutation impacts, and membrane protein mutation impacts.
- Publication:
-
PLoS Computational Biology
- Pub Date:
- July 2017
- DOI:
- arXiv:
- arXiv:1704.00063
- Bibcode:
- 2017PLSCB..13E5690C
- Keywords:
-
- Quantitative Biology - Quantitative Methods
- E-Print:
- 20 pages, 8 figures, 5 tables