An information-based measure for grouping quality

Erik A. Engbers; Michael Lindenbaum; Arnold W.M. Smeulders

Journal ArticleOPEN ACCESS

An information-based measure for grouping quality

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3023 392-404

DOI: 10.1007/978-3-540-24672-5_31

1Citations

12Readers

Abstract

We propose a method for measuring the quality of a grouping result, based on the following observation: a better grouping result provides more information about the true, unknown grouping. The amount of information is evaluated using an automatic procedure, relying on the given hypothesized grouping, which generates (homogeneity) queries about the true grouping and answers them using an oracle. The process terminates once the queries suffice to specify the true grouping. The number of queries is a measure of the hypothesis non-informativeness. A relation between the query count and the (probabilistically characterized) uncertainty of the true grouping, is established and experimentally supported. The proposed information-based quality measure is free from arbitrary choices, uniformly treats different types of grouping errors, and does not favor any algorithm. We also found that it approximates human judgment better than other methods and gives better results when used to optimize a segmentation algorithm. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Engbers, E. A., Lindenbaum, M., & Smeulders, A. W. M. (2004). An information-based measure for grouping quality. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3023, 392–404. https://doi.org/10.1007/978-3-540-24672-5_31

An information-based measure for grouping quality

Abstract

Cite

Register to see more suggestions