Clustering and prediction of rankings within a kemeny distance framework

Willem J. Heiser; Antonio D'Ambrosio

Conference Proceedings

Clustering and prediction of rankings within a kemeny distance framework

Studies in Classification, Data Analysis, and Knowledge Organization (2013) 19-31

DOI: 10.1007/978-3-319-00035-0_2

31Citations

16Readers

Get full text

Abstract

Rankings and partial rankings are ubiquitous in data analysis, yet there is relatively little work in the classification community that uses the typical properties of rankings. We review the broader literature that we are aware of, and identify a common building block for both prediction of rankings and clustering of rankings, which is also valid for partial rankings. This building block is the Kemeny distance, defined as the minimum number of interchanges of two adjacent elements required to transform one (partial) ranking into another. The Kemeny distance is equivalent to Kendall's τ for complete rankings, but for partial rankings it is equivalent to Emond and Mason's extension of τ. For clustering, we use the flexible class of methods proposed by Ben-Israel and Iyigun (Journal of Classification 25: 5-26, 2008), and define the disparity between a ranking and the center of cluster as the Kemeny distance. For prediction, we build a prediction tree by recursive partitioning, and define the impurity measure of the subgroups formed as the sum of all within-node Kemeny distances. The median ranking characterizes subgroups in both cases. © Springer International Publishing Switzerland 2013.

Cite

CITATION STYLE

APA

Heiser, W. J., & D’Ambrosio, A. (2013). Clustering and prediction of rankings within a kemeny distance framework. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 19–31). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-319-00035-0_2

Clustering and prediction of rankings within a kemeny distance framework

Abstract

Cite

Register to see more suggestions