In large discrete data sets which requires classification into signal and noise components, the distribution of the signal is often very bumpy and does not follow a standard distribution. Therefore the signal distribution is further modelled as a mixture of component distributions. However, when the signal component is modelled as a mixture of distributions, we are faced with the challenges of justifying the number of components and the label switching problem (caused by multimodality of the likelihood function). To circumvent these challenges, we propose a non-parametric structure for the signal component. This new method is more efficient in terms of precise estimates and better classifications. We demonstrated the efficacy of the methodology using a ChIP-sequencing data set.
CITATION STYLE
Alhaji, B. B., Dai, H., Hayashi, Y., Vinciotti, V., Harrison, A., & Lausen, B. (2016). Analysis of ChIP-seq data via Bayesian finite mixture models with a non-parametric component. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 507–517). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-319-25226-1_43
Mendeley helps you to discover research relevant for your work.