Abstract
We describe the R package rmcfs that implements an algorithm for ranking features from high dimensional data according to their importance for a given supervised classification task. The ranking is performed prior to addressing the classification task per se. This R package is the new and extended version of the MCFS (Monte Carlo feature selection) algorithm where an early version was published in 2005. The package provides an easy R interface, a set of tools to review results and the new ID (interdependency discovery) component. The algorithm can be used on continuous and/or categorical features (e.g., gene expression and phenotypic data) to produce an objective ranking of features with a statistically well-defined cutoff between informative and non-informative ones. Moreover, the directed ID graph that presents interdependencies between informative features is provided.
Author supplied keywords
Cite
CITATION STYLE
Draminski, M., & Koronacki, J. (2018). Rmcfs: An R package for monte carlo feature selection and interdependency discovery. Journal of Statistical Software, 85. https://doi.org/10.18637/jss.v085.i12
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.