Existing cancer benchmark data sets for human sequencing data use germline variants, synthetic methods, or expensive validations, none of which are satisfactory for providing a large collection of true somatic variation across a whole genome. Here we propose a data set, Lineage derived Somatic Truth (LinST), of short somatic mutations in the HT115 colon cancer cell-line, that are validated using a known cell lineage that includes thousands of mutations and a high confidence region covering 2.7 gigabases per sample.
CITATION STYLE
Shand, M., Soto, J., Lichtenstein, L., Benjamin, D., Farjoun, Y., Brody, Y., … Banks, E. (2020). A validated lineage-derived somatic truth data set enables benchmarking in cancer genome analysis. Communications Biology, 3(1). https://doi.org/10.1038/s42003-020-01460-9
Mendeley helps you to discover research relevant for your work.