Non-repetitive DNA sequence compression using memoization

K. G. Srinivasa; M. Jagadish; K. R. Venugopal; L. M. Patnaik

Conference Proceedings

Non-repetitive DNA sequence compression using memoization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4345 LNBI 402-412

DOI: 10.1007/11946465_36

2Citations

2Readers

Get full text

Abstract

With increasing number of DNA sequences being discovered the problem of storing and using genomic databases has become vital. Since DNA sequences consist of only four letters, two bits are sufficient to store each base. Many algorithms have been proposed in the recent past that push the bits/base limit further. The subtle patterns in DNA along with statistical inferences have been exploited to increase the compression ratio. From the compression perspective, the entire DNA sequences can be considered to be made of two types of sequences: repetitive and non-repetitive. The repetitive parts are compressed used dictionary-based schemes and non-repetitive sequences of DNA are usually compressed using general text compression schemes. In this paper, we present a niemoization based encoding scheme for non-repeat DNA sequences. This scheme is incorporated with a DNA-specific compression algorithm, DNAPack, which is used for compression of DNA sequences. The results show that our method noticeably performs better than other techniques of its kind. © Springer-Verlag Berlin Heidelberg 2006.

Author supplied keywords

Cite

CITATION STYLE

APA

Srinivasa, K. G., Jagadish, M., Venugopal, K. R., & Patnaik, L. M. (2006). Non-repetitive DNA sequence compression using memoization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4345 LNBI, pp. 402–412). Springer Verlag. https://doi.org/10.1007/11946465_36

Non-repetitive DNA sequence compression using memoization

Abstract

Author supplied keywords

Cite

Register to see more suggestions