Metadata vs. Ground-truth: A Myth behind the Evolution of Community Detection Methods

8Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.
Get full text

Abstract

A community detection (CD) method is usually evaluated by what extent it is able to discover the 'ground-truth' community structure of a network. A certain 'node-centric metadata' is used to define the ground-truth partition. However, nodes in real networks often have multiple metadata types (e.g., occupation, location); each can potentially form a ground-truth partition. Our experiment with 10 CD methods on 5 datasets (having multiple metadata-based ground-truth partitions) show that the metadata-based evaluation is misleading because there is no single CD method that can outperform others by detecting all types of metadata-based partitions. We further show that the community structure obtained from the CD methods is usually topologically stronger than any metadata-based partitions. Finally, we suggest a new task-based evaluation framework for CD methods and show that a certain type of CD methods is useful for a certain type of task.

Cite

CITATION STYLE

APA

Chakraborty, T., Cui, Z., & Park, N. (2018). Metadata vs. Ground-truth: A Myth behind the Evolution of Community Detection Methods. In The Web Conference 2018 - Companion of the World Wide Web Conference, WWW 2018 (pp. 45–46). Association for Computing Machinery, Inc. https://doi.org/10.1145/3184558.3186921

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free