Improved Feature Selection Based on Mutual Information for Regression Tasks

Muhammad A. Sulaiman; Jane Labadin

Journal ArticleOPEN ACCESS

Improved Feature Selection Based on Mutual Information for Regression Tasks

Sulaiman M
Labadin J

Journal of IT in Asia (2016) 6(1) 11-24

DOI: 10.33736/jita.330.2016

N/ACitations

7Readers

Abstract

Mutual Information (MI) is an information theory concept often used in the recent time as a criterion for feature selection methods. This is due to its ability to capture both linear and non-linear dependency relationships between two variables. In theory, mutual information is formulated based on probability density functions (pdfs) or entropies of the two variables. In most machine learning applications, mutual information estimation is formulated for classification problems (that is data with labeled output). This study investigates the use of mutual information estimation as a feature selection criterion for regression tasks and introduces enhancement in selecting optimal feature subset based on previous works. Specifically, while focusing on regression tasks, it builds on the previous work in which a scientifically sound stopping criteria for feature selection greedy algorithms was proposed. Four real-world regression datasets were used in this study, three of the datasets are public obtained from UCI machine learning repository and the remaining one is a private well log dataset. Two Machine learning models namely multiple regression and artificial neural networks (ANN) were used to test the performance of IFSMIR. The results obtained has proved the effectiveness of the proposed method.

Cite

CITATION STYLE

APA

Sulaiman, M. A., & Labadin, J. (2016). Improved Feature Selection Based on Mutual Information for Regression Tasks. Journal of IT in Asia, 6(1), 11–24. https://doi.org/10.33736/jita.330.2016

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 4

100%

Readers' Discipline

Engineering 2

67%

Biochemistry, Genetics and Molecular Bi... 1

33%

Improved Feature Selection Based on Mutual Information for Regression Tasks

Abstract

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline