On the approximation of the Kolmogorov complexity for DNA sequences

16Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The Kolmogorov complexity furnishes several ways for studying different natural processes that can be expressed using sequences of symbols from a finite alphabet, such as the case of DNA sequences. Although the Kolmogorov complexity is not algorithmically computable, it can be approximated by lossless normal compressors. In this paper, we use a specific DNA compressor to approximate the Kolmogorov complexity and we assess it regarding its normality. Then, we use it on several datasets, that are constituted by different DNA sequences, representing complete genomes of different species and domains. We show several evolution-related insights associated with the complexity, namely that, globally, archaea have higher relative complexity than bacteria and eukaryotes.

Cite

CITATION STYLE

APA

Pratas, D., & Pinho, A. J. (2017). On the approximation of the Kolmogorov complexity for DNA sequences. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10255 LNCS, pp. 259–266). Springer Verlag. https://doi.org/10.1007/978-3-319-58838-4_29

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free