Creation of Text Document Matrices and Visualization by Self-Organizing Map

  • Stefanovič P
  • Kurasova O
N/ACitations
Citations of this article
17Readers
Mendeley users who have this article in their library.

Abstract

In the paper, text mining and visualization by self-organizing map (SOM) are investigated. At first, textual information must be converted into numerical one. The results of text mining and visualization depend on the conversion. So, the influence of some control factors (the common word list and usage of the stemming algorithm) on text mining results, when a document dictionary is created, is investigated. A self-organizing map is used for text clustering and graphical representation (visualization). A comparative analysis is made where a dataset consists of scientific papers about the optimization, based on Pareto, simplex, and genetic algorithms. Two new measures are also proposed to estimate the SOM quality when the classified data are analyzed: distances between SOM cells, corresponding to data items assigned to the same class, and the distance between centers of SOM cells, corresponding to different classes. The quantization error is measured to estimate the SOM quality, too. DOI: http://dx.doi.org/10.5755/j01.itc.43.1.4299

Cite

CITATION STYLE

APA

Stefanovič, P., & Kurasova, O. (2014). Creation of Text Document Matrices and Visualization by Self-Organizing Map. Information Technology and Control, 43(1), 37–46. https://doi.org/10.5755/j01.itc.43.1.4299

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free