An overview of web data clustering practices

48Citations
Citations of this article
52Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Clustering is a challenging topic in the area of Web data management. Various forms of clustering are required in a wide range of applications, including finding mirrored Web pages, detecting copyright violations, and reporting search results in a structured way. Clustering can either be performed once offline, (independently to search queries), or online (on the results of search queries). Important efforts have focused on mining Web access logs and to cluster search engine results on the fly. Online methods based on link structure and text have been applied successfully to finding pages on related topics. This paper presents an overview of the most popular methodologies and implementations in terms of clustering either Web users or Web sources and presents a survey about current status and future trends in clustering employed over the Web. © Springer-Verlag 2004.

Cite

CITATION STYLE

APA

Vakali, A., Pokomý, J., & Dalamagas, T. (2004). An overview of web data clustering practices. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 3268, 597–606. https://doi.org/10.1007/978-3-540-30192-9_59

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free