A Hybrid Regression Model for Mixed Numerical and Categorical Data

Nouf Alghanmi; Xiao Jun Zeng

Conference Proceedings

A Hybrid Regression Model for Mixed Numerical and Categorical Data

Advances in Intelligent Systems and Computing (2020) 1043 369-376

DOI: 10.1007/978-3-030-29933-0_31

1Citations

2Readers

Get full text

Abstract

It is noticeable in different heterogeneity types that complexity is inherent in heterogeneous data, and regression analysis methods are well defined and exhibit high-accuracy performance with numeric data. However, real-world problems contain non-numerical variables. There are two main approaches to handling mixed-type data sets in regression analyses. The first approach is unifying data types for all the variables (such as continuous numerical data) and then applying the regression analysis. However, this approach degrades the data quality, as some original data types are converted to other types in the learning stage. The second approach is to apply some similarity measurements, which can be highly complex in some situations. To overcome these limitations, we propose a tree-based regression model to effectively handle the mixed-type data sets without using a dummy code or a similarity measurement.

Author supplied keywords

Cite

CITATION STYLE

APA

Alghanmi, N., & Zeng, X. J. (2020). A Hybrid Regression Model for Mixed Numerical and Categorical Data. In Advances in Intelligent Systems and Computing (Vol. 1043, pp. 369–376). Springer Verlag. https://doi.org/10.1007/978-3-030-29933-0_31

A Hybrid Regression Model for Mixed Numerical and Categorical Data

Abstract

Author supplied keywords

Cite

Register to see more suggestions