This paper introduces a novel algorithm for DNA sequence compression that makes use of a transformation and statistical properties within the transformed sequence. The designed compression algorithm is efficient and effective for DNA sequence compression. As a statistical compression method, it is able to search the pattern inside the compressed text which is useful in knowledge discovery. Experiments show that our algorithm is shown to outperform existing compressors on typical DNA sequence datasets. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Gupta, A., Rishiwal, V., & Agarwal, S. (2010). Efficient storage of massive biological sequences in compact form. In Communications in Computer and Information Science (Vol. 95 CCIS, pp. 13–22). https://doi.org/10.1007/978-3-642-14825-5_2
Mendeley helps you to discover research relevant for your work.