a new coincept-based similarty measyre wich makes use of the concept analysis on the sencente, document, corpus levels and on web documents is proiposed. This similarity measure out performs other similarity measires that are based on therm analysis models fo the document only. The similarity between documents is based on a combination of sencence .- based, document-based corpus-based and web documenbt - based concept analysis Similartyu based on matching od concepts between document pairs, is shown to have a more significant effect on the clustering qualoity due to the similarity¡s insensitivity to noisy terms that can lead to an incorrect similarity. The concept are less sensitive to noise when it comes to calculating document similarty. We can efficiently find significant matching concepts betwen web documents, according to the semantics of their sencentes present in the web document.
CITATION STYLE
Divya, J., & Kusuma, S. (2012). Text Clustering for web documewnts Using Concept - Based Minig Model. International Journal of Emerging Trends in Engineering and Development, 1.6(2). Retrieved from http://rspublication.com/ijeted/ijeted sep 12/1.pdf
Mendeley helps you to discover research relevant for your work.