Tumor cells are genetically heterogeneous. The collection of the entire tumor cell population consists of different subclones that can be characterized by mutations in sequence and structure at various genomic locations. Using nextgeneration sequencing data, we characterize tumor heterogeneity using Bayesian nonparametric inference. Specifically, we estimate the number of subclones in a tumor sample, and for each subclone, we estimate the subclonal copy number and single nucleotide mutations at a selected set of loci. Posterior summaries are presented in three matrices, namely, the matrix of subclonal copy numbers (L), subclonal variant alleles (Z), and the population frequencies of the subclones (w). The proposed method can handle a single or multiple tumor samples. Computation via Markov chain Monte Carlo yields posterior Monte Carlo samples of all three matrices, allowing for the assessment of any desired inference summary. Simulation and real-world examples are provided as illustration.
CITATION STYLE
Ji, Y., Sengupta, S., Lee, J., Müller, P., & Gulukota, K. (2015). Estimating latent cell subpopulations with Bayesian feature allocation models. In Nonparametric Bayesian Inference in Biostatistics (pp. 77–96). Springer International Publishing. https://doi.org/10.1007/978-3-319-19518-6_4
Mendeley helps you to discover research relevant for your work.