Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment service based on the Hadoop distribution framework. This service includes a protein structure alignment algorithm, a refinement algorithm, and a MapReduce programming model. The refinement algorithm refines the result of alignment. To process vast numbers of protein structures in parallel, the alignment and refinement algorithms are implemented using MapReduce. We analyzed and compared the structure alignments produced by different methods using a dataset randomly selected from the PDB database. The experimental results verify that the proposed algorithm refines the resulting alignments more accurately than existing algorithms. Meanwhile, the computational performance of the proposed service is proportional to the number of processors used in our cloud platform. © 2013 Che-Lun Hung and Yaw-Ling Lin.
CITATION STYLE
Hung, C. L., & Lin, Y. L. (2013). Implementation of a parallel protein structure alignment service on cloud. International Journal of Genomics, 2013. https://doi.org/10.1155/2013/439681
Mendeley helps you to discover research relevant for your work.