WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles

  • Ypma A
  • Heskes T
  • Zaïane O
  • et al.
N/ACitations
Citations of this article
6Readers
Mendeley users who have this article in their library.

Abstract

We propose mixtures of hidden Markov models for modelling clickstreams of web surfers. Hence, the page categorization is learned from the data without the need for a (possibly cumbersome) manual categorization. We provide an EM algorithm for training a mixture of HMMs and show that additional static user data can be incorporated easily to possibly enhance the labelling of users. Furthermore, we use prior knowledge to enhance generalization and avoid numerical problems. We use parameter tying to decrease the danger of overfitting and to reduce computational overhead. We put a flat prior on the parameters to deal with the problem that certain transitions between page categories occur very seldom or not at all, in order to ensure that a nonzero transition probability between these categories nonetheless remains. In applications to artificial data and real-world web logs we demonstrate the usefulness of our approach. We train a mixture of HMMs on artificial navigation patterns, and show that the correct model is being learned. Moreover, we show that the use of static ’satellite data’ may enhance the labeling of shorter navigation patterns. When applying a mixture of HMMs to real-world web logs from a large Dutch commercial web site, we demonstrate that sensible page categorizations are being learned.

Author supplied keywords

Cite

CITATION STYLE

APA

Ypma, A., Heskes, T., Zaïane, O., Srivastava, J., Spiliopoulou, M., & Masand, B. (2003). WEBKDD 2002 - Mining Web Data for Discovering Usage Patterns and Profiles. (O. R. Zaïane, J. Srivastava, M. Spiliopoulou, & B. Masand, Eds.) (Vol. 2703, pp. 35–49). Berlin, Heidelberg: Springer Berlin Heidelberg. Retrieved from http://www.springerlink.com/content/uxg4txcf3g9c0rjr/

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free