Clustering Nominal data with Equivalent Categories

Marian Hickendorff; Willem J. Heiser; Cornells M. van Putten; Norman D. Verhelst

Journal ArticleOPEN ACCESS

Clustering Nominal data with Equivalent Categories

Hickendorff M
Heiser W
van Putten C
et al.

Behaviormetrika (2008) 35(1) 35-54

DOI: 10.2333/bhmk.35.35

N/ACitations

10Readers

Get full text

Abstract

The problem considered in the present paper is how to cluster data of nominal measurement level, where the categories of the variables are equivalent (the variables are replications of each other). One suitable technique to obtain such a clustering is latent class analysis (LCA) with equality restrictions on the conditional probabilities. As an alternative, a less well known technique is introduced: GROUPALS. This is an algorithm for the simultaneous scaling (by multiple correspondence analysis) and clustering of categorical variables. Equality restrictions on the category quantifications were incorporated in the algorithm, to account for equivalent categories. In two simulation studies, the clustering performance was assessed by measuring the recovery of true cluster membership of the individuals. The effect of several systematically varied data features was studied. Restricted LCA obtained good to excellent cluster recovery results. Restricted GROUPALS approximated this optimal performance reasonably well, except when underlying classes were very different in size.

Cite

CITATION STYLE

APA

Hickendorff, M., Heiser, W. J., van Putten, C. M., & Verhelst, N. D. (2008). Clustering Nominal data with Equivalent Categories. Behaviormetrika, 35(1), 35–54. https://doi.org/10.2333/bhmk.35.35

Clustering Nominal data with Equivalent Categories

Abstract

Cite

Register to see more suggestions