Abstract
Motivation: Secondary structures are key descriptors of a protein fold and its topology. In recent years, they facilitated intensive computational tasks for finding structural homologues, fold prediction and protein design. Their popularity stems from an appealing regularity in patterns of geometry and chemistry. However, the definition of secondary structures is of subjective nature. An unsupervised de-novo discovery of these structures would shed light on their nature, and improve the way we use these structures in algorithms of structural bioinformatics. Methods: We developed a new method for unsupervised partitioning of undirected graphs, based on patterns of small recurring network motifs. Our input was the network of all H-bonds and covalent interactions of protein backbones. This method can be also used for other biological and non-biological networks. Results: In a fully unsupervised manner, and without assuming any explicit prior knowledge, we were able to rediscover the existence of conventional α-helices, parallel β-sheets, anti-parallel sheets and loops, as well as various non-conventional hybrid structures. The relation between connectivity and crystallographic temperature factors establishes the existence of novel secondary structures. © 2007 Oxford University Press.
Cite
CITATION STYLE
Raveh, B., Basri, R., & Schreiber, G. (2007). Rediscovering secondary structures as network motifs - An unsupervised learning approach. In Bioinformatics (Vol. 23). Oxford University Press. https://doi.org/10.1093/bioinformatics/btl290
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.