Estimation of minimum sample size for identification of the most important features: A case study providing a qualitative B2B sales data set

Marko Bohanec; Mirjana Kljajić Borštnar; Marko Robnik-Šikonja

Journal ArticleOPEN ACCESS

Estimation of minimum sample size for identification of the most important features: A case study providing a qualitative B2B sales data set

Croatian Operational Research Review (2017) 8(2) 515-524

DOI: 10.17535/crorr.2017.0033

2Citations

29Readers

Abstract

An important task in machine learning is to reduce data set dimensionality, which in turn contributes to reducing computational load and data collection costs, while improving human understanding and interpretation of models. We introduce an operational guideline for determining the minimum number of instances sufficient to identify correct ranks of features with the highest impact. We conduct tests based on qualitative B2B sales forecasting data. The results show that a relatively small instance subset is sufficient for identifying the most important features when rank is not important.

Author supplied keywords

Cite

CITATION STYLE

APA

Bohanec, M., Borštnar, M. K., & Robnik-Šikonja, M. (2017). Estimation of minimum sample size for identification of the most important features: A case study providing a qualitative B2B sales data set. Croatian Operational Research Review, 8(2), 515–524. https://doi.org/10.17535/crorr.2017.0033

Estimation of minimum sample size for identification of the most important features: A case study providing a qualitative B2B sales data set

Abstract

Author supplied keywords

Cite

Register to see more suggestions