We define a new pairwise sequence comparison scheme for distantly related proteins and report its performance on remote homology detection task, The new scheme compares two protein sequences by using the maximal unique matches (MUM) between them. Once identified, the length of all non-overlapping MUMs is used to define the similarity between two sequences. To detect the homology of a protein to a protein family, we utilize the feature vectors containing all pairwise similarity scores between the test protein and the proteins in the training set. Support vector machines are employed for the binary classification in the same way that the recent works have done, The new method is shown to be more accurate than the recent methods including SVM-Fisher and SVM-BLAST, and competitive with SVM-Pairwise. In terms of computational efficiency, the new method performs much better than SVM-Pairwise. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Oǧul, H., & Mumcuogľu, Ü. E. (2005). Discriminative remote homology detection using maximal unique sequence matches. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3646 LNCS, pp. 283–292). https://doi.org/10.1007/11552253_26
Mendeley helps you to discover research relevant for your work.