Mining community structure of named entities from web pages and blogs

Xin Li; Bing Liu; Philip S. Yu

Conference Proceedings

Mining community structure of named entities from web pages and blogs

AAAI Spring Symposium - Technical Report (2006) SS-06-03 108-114

15Citations

23Readers

Abstract

Although community discovery based on social network has been studied extensively in the Web hyperlink environment, limited research has been done in the case of Web documents. The co-occurrence of Words and entities in sentences and documents usually implies some connections among them. Studying such connections may reveal important relationships. In this paper, we investigate the co-occurrences of named entities in Web pages and blogs, and mine communities among those entities. We show that identifying communities in such an environment can be transformed into a graph clustering problem. A hierarchical clustering algorithm is then proposed, which exploits triangle structures within the graph and the mutual information between vertices. Our empirical study shows that the proposed algorithm is promising in discovering communities from Web documents. Compilation copyright © 2006, American Association for Artificial Intelligence (www.aaai.org). All rights reserved.

Cite

CITATION STYLE

APA

Li, X., Liu, B., & Yu, P. S. (2006). Mining community structure of named entities from web pages and blogs. In AAAI Spring Symposium - Technical Report (Vol. SS-06-03, pp. 108–114).

Mining community structure of named entities from web pages and blogs

Abstract

Cite

Register to see more suggestions