The 3D structure of a protein is closely related to its function, and the similarity analysis between their structures can help reveal the function of proteins. However, there exist two problems arising from the analysis of 3D structures of proteins. The proteins with a similar sequence may have different structures, while the proteins with a similar structure may have different sequences. In the analysis of similarity in 3D structures of proteins, it remains difficult for the traditional methods using the spatial feature distribution and geometry or topology features of proteins to solve these problems. In this paper, a Tile-CNN network is proposed to analyze the similarity of proteins in 3D structure. In order to capture the overall and the local features as exhibited by the 3D structures of proteins, it projects 3D protein models into 2D protein images from different views and then cuts these 2D projected images using the tile strategy. After the training of proteins with these images in the Tile-CNN, the test protein model can be expressed by an analysis matrix, and then the similarity between 3D structures of proteins is computed using the root mean square distance (RMSD) for the benchmark matrix and the analysis matrix. As revealed by the experimental results, the proposed algorithm is more robust in analyzing the similarity of 3D structures of proteins and produces a satisfactory performance in solving the two aforementioned problems.
CITATION STYLE
Qin, S., Li, Z., He, L., & Lin, W. (2020). Similarity Analysis of 3D Structures of Proteins Based Tile-CNN. IEEE Access, 8, 44622–44631. https://doi.org/10.1109/ACCESS.2020.2977945
Mendeley helps you to discover research relevant for your work.