Transformations for the compression of fastq quality scores of next-generation sequencing data

48Citations
Citations of this article
107Readers
Mendeley users who have this article in their library.

Abstract

Motivation: The growth of next-generation sequencing means that more effective and efficient archiving methods are needed to store the generated data for public dissemination and in anticipation of more mature analytical methods later. This article examines methods for compressing the quality score component of the data to partly address this problem.Results: We compare several compression policies for quality scores, in terms of both compression effectiveness and overall efficiency. The policies employ lossy and lossless transformations with one of several coding schemes. Experiments show that both lossy and lossless transformations are useful, and that simple coding methods, which consume less computing resources, are highly competitive, especially when random access to reads is needed. © The Author 2011. Published by Oxford University Press. All rights reserved.

Cite

CITATION STYLE

APA

Wan, R., Anh, V. N., & Asai, K. (2012). Transformations for the compression of fastq quality scores of next-generation sequencing data. Bioinformatics, 28(5), 628–635. https://doi.org/10.1093/bioinformatics/btr689

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free