Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination

26Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

Abstract

We propose a hierarchical Bayesian nonparametric mixture model for clustering when some of the covariates are assumed to be of varying relevance to the clustering problem. This can be thought of as an issue in variable selection for unsupervised learning. We demonstrate that by defining a hierarchical pop-ulation based nonparametric prior on the cluster locations scaled by the inverse covariance matrices of the likelihood we arrive at a 'sparsity prior' representation which admits a conditionally conjugate prior. This allows us to perform full Gibbs sampling to obtain posterior distributions over parameters of interest including an explicit measure of each covariate's relevance and a distribution over the number of potential clusters present in the data. This also allows for individual cluster specific variable selection. We demonstrate improved inference on a number of canonical problems. © 2011 International Society for Bayesian Analysis.

Cite

CITATION STYLE

APA

Yau, C., & Holmes, C. (2011). Hierarchical Bayesian nonparametric mixture models for clustering with variable relevance determination. Bayesian Analysis, 6(2), 329–352. https://doi.org/10.1214/11-BA612

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free