New Features of Categorical Principal Components Analysis for Complicated Data Sets, Including Data Mining

J. J. Meulman; A. J. van der Kooij; A. Babinec

Book Chapter

New Features of Categorical Principal Components Analysis for Complicated Data Sets, Including Data Mining

Meulman J
van der Kooij A
Babinec A

DOI: 10.1007/978-3-642-55991-4_22

N/ACitations

8Readers

Get full text

Abstract

This paper focuses on a technique to perform principal components analysis with nonlinear scaling of variables, and having correspondence analysis features. Special attention will be given to particular properties that make the technique suited for data mining. In addition to fitting of points for individual objects or subjects, additional points may be fitted to identify groups among them. There is a large emphasis on graphical display of the results in biplots (with variables and objects) and triplots (with variables, objects, and groups). The information contained in the biplots and triplots is used to draw special graphs that identify particular groups in the data that stand out on selected variables. Supplementary variables and objects may be used to link different data sets in a single representation. When a fixed configuration of points is given, the technique may be used for property fitting, i.e., fitting external information into the space. The method can be used to analyze very large data sets by assuming that the variables are categorical; when, however, continuous variables are available as well, these can be made discrete by various optimal procedures. Ordered (ordinal) and non-ordered (nominal) data can be handled by the use of monotonic or non-monotonic (spline) transformations. A state-of-the-art computer program (called CATPCA) is available from SPSS Categories 10.0 onwards.

Cite

CITATION STYLE

APA

Meulman, J. J., van der Kooij, A. J., & Babinec, A. (2002). New Features of Categorical Principal Components Analysis for Complicated Data Sets, Including Data Mining (pp. 207–217). https://doi.org/10.1007/978-3-642-55991-4_22

New Features of Categorical Principal Components Analysis for Complicated Data Sets, Including Data Mining

Abstract

Cite

Register to see more suggestions