We consider the design and evaluation of short barcodes, with a length between six and eight nucleotides, used for parallel sequencing on platforms where substitution errors dominate. Such codes should have not only good error correction properties but also the code words should fulfil certain biological constraints (experimental parameters). We compare published barcodes with codes obtained by two new constructions methods, one based on the currently best known linear codes and a simple randomized construction method. The evaluation done is with respect to the error correction capabilities, barcode size and their experimental parameters and fundamental bounds on the code size and their distance properties. We provide a list of codes for lengths between six and eight nucleotides, where for length eight, two substitution errors can be corrected. In fact, no code with larger minimum distance can exist. © 2013 Mir et al.
CITATION STYLE
Mir, K., Neuhaus, K., Bossert, M., & Schober, S. (2013). Short barcodes for next generation sequencing. PLoS ONE, 8(12). https://doi.org/10.1371/journal.pone.0082933
Mendeley helps you to discover research relevant for your work.