An overview of web data clustering practices

Athena Vakali; Jaroslav Pokomý; Theodore Dalamagas

Journal Article

An overview of web data clustering practices

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3268 597-606

DOI: 10.1007/978-3-540-30192-9_59

48Citations

52Readers

Get full text

Abstract

Clustering is a challenging topic in the area of Web data management. Various forms of clustering are required in a wide range of applications, including finding mirrored Web pages, detecting copyright violations, and reporting search results in a structured way. Clustering can either be performed once offline, (independently to search queries), or online (on the results of search queries). Important efforts have focused on mining Web access logs and to cluster search engine results on the fly. Online methods based on link structure and text have been applied successfully to finding pages on related topics. This paper presents an overview of the most popular methodologies and implementations in terms of clustering either Web users or Web sources and presents a survey about current status and future trends in clustering employed over the Web. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Vakali, A., Pokomý, J., & Dalamagas, T. (2004). An overview of web data clustering practices. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3268, 597–606. https://doi.org/10.1007/978-3-540-30192-9_59

An overview of web data clustering practices

Abstract

Cite

Register to see more suggestions