Ambiguity is a challenge faced by systems that handle natural language. To assuage the issue of linguistic ambiguities found in text classification, this work proposes a text categorizer using the methodology of Fuzzy Similarity. The clustering algorithms Stars and Cliques are adopted in the Agglomerative Hierarchical method and they identify the groups of texts by specifying some type of relationship rule to create categories based on the similarity analysis of the textual terms. The proposal is based on the methodology suggested, categories can be created from the analysis of the degree of similarity of the texts to be classified, without needing to determine the number of initial categories. The combination of techniques proposed in the categorizer’s steps brought satisfactory results, proving to be efficient in textual classification.
CITATION STYLE
Guelpeli, M. V. C., Garcia, A. C. B., & Bernardini, F. C. (2010). An analysis of constructed categories for textual classification using fuzzy similarity and agglomerative hierarchical methods. In Advanced Information and Knowledge Processing (Vol. 52, pp. 277–306). Springer-Verlag London Ltd. https://doi.org/10.1007/978-1-84996-077-9_11
Mendeley helps you to discover research relevant for your work.