Bootstrap rank-ordered conditional mutual information (broCMI): A nonlinear input variable selection method for water resources modeling

78Citations
Citations of this article
56Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The input variable selection problem has recently garnered much interest in the time series modeling community, especially within water resources applications, demonstrating that information theoretic (nonlinear)-based input variable selection algorithms such as partial mutual information (PMI) selection (PMIS) provide an improved representation of the modeled process when compared to linear alternatives such as partial correlation input selection (PCIS). PMIS is a popular algorithm for water resources modeling problems considering nonlinear input variable selection; however, this method requires the specification of two nonlinear regression models, each with parametric settings that greatly influence the selected input variables. Other attempts to develop input variable selection methods using conditional mutual information (CMI) (an analog to PMI) have been formulated under different parametric pretenses such as k nearest-neighbor (KNN) statistics or kernel density estimates (KDE). In this paper, we introduce a new input variable selection method based on CMI that uses a nonparametric multivariate continuous probability estimator based on Edgeworth approximations (EA). We improve the EA method by considering the uncertainty in the input variable selection procedure by introducing a bootstrap resampling procedure that uses rank statistics to order the selected input sets; we name our proposed method bootstrap rank-ordered CMI (broCMI). We demonstrate the superior performance of broCMI when compared to CMI-based alternatives (EA, KDE, and KNN), PMIS, and PCIS input variable selection algorithms on a set of seven synthetic test problems and a real-world urban water demand (UWD) forecasting experiment in Ottawa, Canada.

References Powered by Scopus

A Mathematical Theory of Communication

37376Citations
N/AReaders
Get full text

Elements of Information Theory

36728Citations
N/AReaders
Get full text

Extreme learning machine: Theory and applications

12105Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Machine learning algorithms for modeling groundwater level changes in agricultural regions of the U.S.

299Citations
N/AReaders
Get full text

Stream-flow forecasting using extreme learning machines: A case study in a semi-arid region in Iraq

274Citations
N/AReaders
Get full text

Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model

266Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Quilty, J., Adamowski, J., Khalil, B., & Rathinasamy, M. (2016). Bootstrap rank-ordered conditional mutual information (broCMI): A nonlinear input variable selection method for water resources modeling. Water Resources Research, 52(3), 2299–2326. https://doi.org/10.1002/2015WR016959

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 20

61%

Researcher 9

27%

Professor / Associate Prof. 4

12%

Readers' Discipline

Tooltip

Engineering 15

63%

Computer Science 3

13%

Earth and Planetary Sciences 3

13%

Agricultural and Biological Sciences 3

13%

Save time finding and organizing research with Mendeley

Sign up for free