The obtention of a set of homogeneous classes of pages according to the browsing patterns identified in web server log files can be very useful for the analysis of organization of the site and of its adequacy to user needs. Such a set of homogeneous classes is often obtained from a dissimilarity measure between the visited pages defined via the visits extracted from the logs. There are however many possibilities for defined such a measure. This paper presents an analysis of different dissimilarity measures based on the comparison between the semantic structure of the site identified by experts and the clustering constructed with standard algo- rithms applied to the dissimilarity matrices generated by the chosen measures.
CITATION STYLE
Rossi, F., De Carvalho, F., Lechevallier, Y., & Da Silva, A. (2006). Dissimilarities for Web Usage Mining (pp. 39–46). https://doi.org/10.1007/3-540-34416-0_5
Mendeley helps you to discover research relevant for your work.