Abstract
Motivation: Multiple sequence alignment is an essential part of bioinformatics tools for a genome-scale study of genes and their evolution relations. However, making an accurate alignment between remote homologs is challenging. Here, we develop a method, called SPEM, that aligns multiple sequences using pre-processed sequence profiles and predicted secondary structures for pairwise alignment, consistency-based scoring for refinement of the pairwise alignment and a progressive algorithm for final multiple alignment. Results: The alignment accuracy of SPEM is compared with those of established methods such as ClustalW, T-Coffee, MUSCLE, ProbCons and PRALINEPSI in easy (homologs) and hard (remote homologs) benchmarks. Results indicate that the average sum of pairwise alignment scores given by SPEM are 7-15% higher than those of the methods compared in aligning remote homologs (sequence identity <30%). Its accuracy for aligning homologs (sequence identity >30%) is statistically indistinguishable from those of the state-of-the-art techniques such as ProbCons or MUSCLE 6.0. © Author 2005. Published by Oxford University Press. All rights reserved.
Cite
CITATION STYLE
Zhou, H., & Zhou, Y. (2005). SPEM: Improving multiple sequence alignment with sequence profiles and predicted secondary structures. Bioinformatics, 21(18), 3615–3621. https://doi.org/10.1093/bioinformatics/bti582
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.