An intelligent system for web usage data preprocessing

3Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Web mining is an application of data mining technologies for huge data repositories. Before applying web mining techniques, the data in the web log has to be pre-processed, integrated and transformed. As the World Wide Web is continuously and rapidly growing, it is necessary for the web miners to utilize intelligent tools in order to find, extract, filter and evaluate the desired information. The data preprocessing stage is the most important phase in the process of web mining and is critical and complex in successful extraction of useful data. The web log is incremental in nature, thus conventional data preprocessing techniques were proved to be not suitable as they assume that the data is static. The web logs are non scalable, impractical and are distributed in nature. Hence we require a comprehensive learning algorithm in order to get the desired information. This paper introduces an intelligent system, capable of preprocessing web logs efficiently. It can identify human user and web search engine accesses intelligently, in less time. The system discussed reduces the error rate and improves significant learning performance of the learning algorithm. The work ensures the goodness of split by using popular measures like Entropy and Gini index. The experimental results proving this claim are given in this paper. © Springer-Verlag Berlin Heidelberg 2011.

Cite

CITATION STYLE

APA

Maheswara Rao, V. V. R., Valli Kumari, V., & Raju, K. V. S. V. N. (2011). An intelligent system for web usage data preprocessing. In Communications in Computer and Information Science (Vol. 131 CCIS, pp. 481–490). https://doi.org/10.1007/978-3-642-17857-3_47

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free