Nowadays, it is often required in modern condition monitoring applications, to describe acquired signal by set of parameters. It directly leads to mD diagnostic data. Before starting the proper analysis of the recorded data, it is advisable to look at the data globally to get an idea what really they are representing. Visualization of mD data is a challenging problem and probably it is not possible to find an ideal method that could take into account all aspects in case of high dimensional, nonlinear, redundant, etc., data. We propose to use for that goal jointly the triplet multivariate visualization methods: Self-organizing maps, Parallel coordinate plots and t-distributed Stochastic neighbor embedding. The methods use concepts of Machine Learning, simple Geometry and Probabilistic Modeling for finding indices of distances or similarities between the data vectors represented in the multivariate data space as data points. The methods permit to visualize the data points in a plane with possibly preserving their mutual between-point distances in the multidimensional data space. The three proposed methods are complementary, and they are supplementing each other. The considerations are illustrated using a data matrix X of size (1000 × 15) containing gearbox diagnostic data structured into 4 (sub)groups. Indeed, the three applied (unsupervised) methods permit to get an insight into the 15-dimensional data space and to state that data points belonging to different subgroups of X have different geometrical location. However, the employed methods do not yield indications for reducing the dimensionality (number of variables) of the considered data.
CITATION STYLE
Bartkowiak, A. M., & Zimroz, R. (2018). Complementary view on multivariate data structure based on kohonen’s SOM, parallel coordinates and t-SNE methods. In Applied Condition Monitoring (Vol. 9, pp. 255–265). Springer. https://doi.org/10.1007/978-3-319-61927-9_24
Mendeley helps you to discover research relevant for your work.