Simplifying amino acid alphabets using a genetic algorithm and sequence alignment

4Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In some areas of bioinformatics (like protein folding or sequence alignment) the full alphabet of amino acid symbols is not necessary. Often, better results are received with simplified alphabets. In general, simplified alphabets are as universal as possible. In this paper we show that this concept may not be optimal. We present a genetic algorithm for alphabet simplifying and we use it in a method based on global sequence alignment. We demonstrate that our algorithm is much faster and produces better results than the previously presented genetic algorithm. We also compare alphabets constructed on the base of universal substitution matrices like BLOSUM with our alphabets built through sequence alignment and propose a new coefficient describing the value of alphabets in the sequence alignment context. Finally we show that our simplified alphabets give better results in a sequence classification (using k-NN classifier), than most previously presented simplified alphabets and better than full 20-letter alphabet. © Springer-Verlag Berlin Heidelberg 2007.

Cite

CITATION STYLE

APA

Lenckowski, J., & Walczak, K. (2007). Simplifying amino acid alphabets using a genetic algorithm and sequence alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4447 LNCS, pp. 122–131). Springer Verlag. https://doi.org/10.1007/978-3-540-71783-6_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free