Featuremaximization (F-max) is an unbiased quality estimationmetric of unsupervised classification (clustering) that favours clusters with a maximal feature F-measure value. In this article we show that an adaptation of this metric within the framework of supervised classification allows efficient feature selection and feature contrasting to be performed.We experiment the method on different types of textual data. In this context, we demonstrate that this technique significantly improves the performance of classification methods as compared with the use of state-of-the art feature selection techniques, notably in the case of the classification of unbalanced, highly multidimensional and noisy textual data gathered in similar classes.
CITATION STYLE
Lamirel, J. C., Cuxac, P., & Hajlaoui, K. (2017). A novel approach to feature selection based on quality estimation metrics. In Studies in Computational Intelligence (Vol. 665, pp. 121–140). Springer Verlag. https://doi.org/10.1007/978-3-319-45763-5_7
Mendeley helps you to discover research relevant for your work.