Rankings and partial rankings are ubiquitous in data analysis, yet there is relatively little work in the classification community that uses the typical properties of rankings. We review the broader literature that we are aware of, and identify a common building block for both prediction of rankings and clustering of rankings, which is also valid for partial rankings. This building block is the Kemeny distance, defined as the minimum number of interchanges of two adjacent elements required to transform one (partial) ranking into another. The Kemeny distance is equivalent to Kendall's τ for complete rankings, but for partial rankings it is equivalent to Emond and Mason's extension of τ. For clustering, we use the flexible class of methods proposed by Ben-Israel and Iyigun (Journal of Classification 25: 5-26, 2008), and define the disparity between a ranking and the center of cluster as the Kemeny distance. For prediction, we build a prediction tree by recursive partitioning, and define the impurity measure of the subgroups formed as the sum of all within-node Kemeny distances. The median ranking characterizes subgroups in both cases. © Springer International Publishing Switzerland 2013.
CITATION STYLE
Heiser, W. J., & D’Ambrosio, A. (2013). Clustering and prediction of rankings within a kemeny distance framework. In Studies in Classification, Data Analysis, and Knowledge Organization (pp. 19–31). Kluwer Academic Publishers. https://doi.org/10.1007/978-3-319-00035-0_2
Mendeley helps you to discover research relevant for your work.