Efficient utility-based clustering over high dimensional partition spaces

5Citations
Citations of this article
9Readers
Mendeley users who have this article in their library.

Abstract

Because of the huge number of partitions of even a moderately sized dataset, even when Bayes factors have a closed form, in model-based clustering a comprehensive search for the highest scoring (MAP) partition is usually impossible. However, when each cluster in a partition has a signature and it is known that some signatures are of scientific interest whilst others are not, it is possible, within a Bayesian framework, to develop search algorithms which are guided by these cluster signatures. Such algorithms can be expected to find better partitions more quickly. In this paper we develop a framework within which these ideas can be formalized. We then briefly illustrate the efficacy of the proposed guided search on a microarray time course data set where the clustering objective is to identify clusters of genes with different types of circadian expression profiles. © 2009 International Society for Bayesian Analysis.

Cite

CITATION STYLE

APA

Liverani, S., Anderson, P. E., Edwards, K. D., Millar, A. J., & Smith, J. Q. (2009). Efficient utility-based clustering over high dimensional partition spaces. Bayesian Analysis, 4(3), 539–572. https://doi.org/10.1214/09-BA420

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free