Tissue samples from the same tumour are heterogeneous. They consist of different subclones that can be characterized by differences in DNA nucleotide sequences and copy numbers on multiple loci. Inference on tumour heterogeneity thus involves the identification of the subclonal copy number and single-nucleotide mutations at a selected set of loci. We carry out such inference on the basis of a Bayesian feature allocation model. We jointly model subclonal copy numbers and the corresponding allele sequences for the same loci, using three random matrices, L, Z and w, to represent subclonal copy numbers (L), the number of subclonal variant alleles (Z) and the cellular fractions (w) of subclones in one or more tumour samples respectively. The unknown number of subclones implies a random number of columns. More than one subclone indicates tumour heterogeneity. Using simulation studies and a real data analysis with next generation sequencing data, we demonstrate how posterior inference on the subclonal structure is enhanced with the joint modelling of both structure and sequencing variants on subclonal genomes. An R package is available from http://cran.r-project.org/web/packages/BayClone2/index.html.
CITATION STYLE
Lee, J., Müller, P., Sengupta, S., Gulukota, K., & Ji, Y. (2016). Bayesian inference for intratumour heterogeneity in mutations and copy number variation. Journal of the Royal Statistical Society. Series C: Applied Statistics, 65(4), 547–563. https://doi.org/10.1111/rssc.12136
Mendeley helps you to discover research relevant for your work.