A Bayesian mixture model for chromatin interaction data

6Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

Chromatin interactions mediated by a particular protein are of interest for studying gene regulation, especially the regulation of genes that are associated with, or known to be causative of, a disease. A recent molecular technique, Chromatin interaction analysis by paired-end tag sequencing (ChIA-PET), that uses chromatin immunoprecipitation (ChIP) and high throughput paired-end sequencing, is able to detect such chromatin interactions genomewide. However, ChIA-PET may generate noise (i.e., pairings of DNA fragments by random chance) in addition to true signal (i.e., pairings of DNA fragments by interactions). In this paper, we propose MC-DIST based on a mixture modeling framework to identify true chromatin interactions from ChIA-PET count data (counts of DNA fragment pairs). The model is cast into a Bayesian framework to take into account the dependency among the data and the available information on protein binding sites and gene promoters to reduce false positives. A simulation study showed that MC-DIST outperforms the previously proposed hypergeometric model in terms of both power and type I error rate. A real data study showed that MC-DIST may identify potential chromatin interactions between protein binding sites and gene promoters that may be missed by the hypergeometric model. An R package implementing the MC-DIST model is available at http://www.stat.osu.edu/~statgen/SOFTWARE/MDM.

Cite

CITATION STYLE

APA

Niu, L., & Lin, S. (2015). A Bayesian mixture model for chromatin interaction data. Statistical Applications in Genetics and Molecular Biology, 14(1), 53–64. https://doi.org/10.1515/sagmb-2014-0029

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free