In this paper, we study fundamental properties of the Self-Organizing Map (SOM) and the Generative Topographic Mapping (GTM), ramifications of the initialization of the algorithms and properties of the algorithms in presence of missing data. We show that the commonly used principal component analysis (PCA) initialization of the GTM does not guarantee good learning results with complex, high-dimensional data. We propose initializing the GTM with SOM and demonstrate usefulness of this improvement using the ISOLET data set. We also propose a revision to the batch SOM algorithm called the Imputation SOM and show that the new algorithm is more robust in presence of missing data. We compare the performance of the algorithms in the missing value imputation task. We also announce a revised version of the SOM Toolbox for Matlab with added GTM functionality. © 2013 Springer-Verlag.
CITATION STYLE
Vatanen, T., Nieminen, I. T., Honkela, T., Raiko, T., & Lagus, K. (2013). Controlling self-organization and handling missing values in SOM and GTM. In Advances in Intelligent Systems and Computing (Vol. 198 AISC, pp. 55–64). Springer Verlag. https://doi.org/10.1007/978-3-642-35230-0_6
Mendeley helps you to discover research relevant for your work.