Modular and configurable optimal sequence alignment software: Cola

2Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: The fundamental challenge in optimally aligning homologous sequences is to define a scoring scheme that best reflects the underlying biological processes. Maximising the overall number of matches in the alignment does not always reflect the patterns by which nucleotides mutate. Efficiently implemented algorithms that can be parameterised to accommodate more complex non-linear scoring schemes are thus desirable.Results: We present Cola, alignment software that implements different optimal alignment algorithms, also allowing for scoring contiguous matches of nucleotides in a nonlinear manner. The latter places more emphasis on short, highly conserved motifs, and less on the surrounding nucleotides, which can be more diverged. To illustrate the differences, we report results from aligning 14,100 sequences from 3' untranslated regions of human genes to 25 of their mammalian counterparts, where we found that a nonlinear scoring scheme is more consistent than a linear scheme in detecting short, conserved motifs.Conclusions: Cola is freely available under LPGL from https://github.com/nedaz/cola. © 2014 Zamani et al.; licensee BioMed Central Ltd.

Cite

CITATION STYLE

APA

Zamani, N., Sundström, G., Höppner, M. P., & Grabherr, M. G. (2014). Modular and configurable optimal sequence alignment software: Cola. Source Code for Biology and Medicine, 9(1). https://doi.org/10.1186/1751-0473-9-12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free