An intelligent system for web usage data preprocessing

V. V.R. Maheswara Rao; V. Valli Kumari; K. V.S.V.N. Raju

Conference Proceedings

An intelligent system for web usage data preprocessing

Communications in Computer and Information Science (2011) 131 CCIS(PART 1) 481-490

DOI: 10.1007/978-3-642-17857-3_47

3Citations

4Readers

Get full text

Abstract

Web mining is an application of data mining technologies for huge data repositories. Before applying web mining techniques, the data in the web log has to be pre-processed, integrated and transformed. As the World Wide Web is continuously and rapidly growing, it is necessary for the web miners to utilize intelligent tools in order to find, extract, filter and evaluate the desired information. The data preprocessing stage is the most important phase in the process of web mining and is critical and complex in successful extraction of useful data. The web log is incremental in nature, thus conventional data preprocessing techniques were proved to be not suitable as they assume that the data is static. The web logs are non scalable, impractical and are distributed in nature. Hence we require a comprehensive learning algorithm in order to get the desired information. This paper introduces an intelligent system, capable of preprocessing web logs efficiently. It can identify human user and web search engine accesses intelligently, in less time. The system discussed reduces the error rate and improves significant learning performance of the learning algorithm. The work ensures the goodness of split by using popular measures like Entropy and Gini index. The experimental results proving this claim are given in this paper. © Springer-Verlag Berlin Heidelberg 2011.

Author supplied keywords

Cite

CITATION STYLE

APA

Maheswara Rao, V. V. R., Valli Kumari, V., & Raju, K. V. S. V. N. (2011). An intelligent system for web usage data preprocessing. In Communications in Computer and Information Science (Vol. 131 CCIS, pp. 481–490). https://doi.org/10.1007/978-3-642-17857-3_47

An intelligent system for web usage data preprocessing

Abstract

Author supplied keywords

Cite

Register to see more suggestions