Clustering is a powerful tool for knowledge discovery in text collections. The quality of document clustering depends not only on clustering algorithms but also on document representation models. We develop a hierarchical document clustering algorithm based on a tolerance rough set model (TRSM) for representing documents, which offers a way of considering semantics relatedness between documents. The results of validation and evaluation of this method suggest that this clustering algorithm can be well adapted to text mining.
CITATION STYLE
Kawasaki, S., Nguyen, N. B., & Ho, T. B. (2000). Hierarchical document clustering based on tolerance rough set model. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1910, pp. 458–463). Springer Verlag. https://doi.org/10.1007/3-540-45372-5_51
Mendeley helps you to discover research relevant for your work.