Large-Scale Community Detection on YouTube for Topic Discovery and Exploration

66Citations
Citations of this article
83Readers
Mendeley users who have this article in their library.

Abstract

Detecting coherent, well-connected communities in large graphs provides insight into the graph structure and can serve as the basis for content discovery. Clustering is a popular technique for community detection but global algorithms that examine the entire graph do not scale. Local algorithms are highly parallelizable but perform sub-optimally, especially in applications where we need to optimize multiple metrics. We present a multi-stage algorithm based on local-clustering that is highly scalable, combining a pre-processing stage, a local clustering stage, and a post-processing stage. We apply this to the YouTube video graph to generate named clusters of videos with coherent content. We formalize coverage, coherence, and connectivity metrics and evaluate the quality of the algorithm for large YouTube graphs. Our use of local algorithms for global clustering, and its implementation and practical evaluation on such a large scale is a first of its kind.

Cite

CITATION STYLE

APA

Gargi, U., Lu, W., Mirrokni, V., & Yoon, S. (2011). Large-Scale Community Detection on YouTube for Topic Discovery and Exploration. In Proceedings of the 5th International AAAI Conference on Weblogs and Social Media, ICWSM 2011 (pp. 486–489). AAAI Press. https://doi.org/10.1609/icwsm.v5i1.14191

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free