Most of existing machine learning techniques can handle objects described by real but not categorical features. In this paper we introduce a simple unsupervised method for transforming categorical feature values into real ones. It is based on low-rank approximations of collaborative feature value frequencies. Once object descriptions are transformed, any common real-value machine learning technique can be applied for further data analysis. For example, it becomes possible to apply classic and powerful Random Forest predictor in supervised learning problems. Our experiments show that a combination of the proposed features transformation method with common real-value supervised algorithms leads to the results that are comparable to the state-of-the-art approaches like Factorization Machines.
CITATION STYLE
Fonarev, A. (2015). Transformation of categorical features into real using low-rank approximations. In Communications in Computer and Information Science (Vol. 505, pp. 253–262). Springer Verlag. https://doi.org/10.1007/978-3-319-25485-2_7
Mendeley helps you to discover research relevant for your work.