Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest

6Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Surface waters are essential for meeting the needs of the world. In many regions, stream water quality is a major concern due to contamination from multiple sources. Stream water is also susceptible to climatic events and land-use practices influencing its catchment. Understanding the impact of such events on stream water quality is crucial for managing and protecting aquatic ecosystems and providing safe drinking water to communities that rely on these streams. Hence, monitoring and evaluating stream water quality holds significance in identifying potential hazards and implementing suitable management strategies. In this paper, a novel effort was made to determine the relative feature importance of a set of watershed characteristics (precipitation, temperature, urban land use, agricultural land use, and forest land-use factors) on four important water quality parameters (WQPs): fecal coliforms (FC), turbidity, pH, and conductivity of the Upper Green River watershed, Kentucky, USA. Random forest (RF), an ensemble learning method, was used to predict the WQPs from the causal parameters and determine the feature importance characteristics of the four WQPs previously mentioned. This model demonstrated that precipitation and temperature are the most influential factors on FC, turbidity, and pH. Forest land use and temperature are the two most important factors for conductivity. The novel feature importance factors of the RF model have likewise been confirmed for each WQP. In modeling stream WQPs, the developed the RF model outperformed the artificial neural network (ANN) model. Using the RF model, we obtain regression coefficients of (0.93, 0.74, and 0.94) for pH in training, testing, and overall. We obtain regression coefficients of (0.60, 0.64, and 0.61) using the ANN model. ⁠⁠⁠⁠⁠⁠⁠Overall, the RF model was more effective than the ANN model in modeling stream WQPs. The model identified precipitation and temperature as the most influential factors on FC, turbidity, and pH, while forest land use and temperature were the most important factors in determining conductivity. It is also found that land use factors are important to improve the accuracy of WQPs predictions from climate variables. The results of this study can be used by authorities to better understand and control pollution at the watershed scale.

Cite

CITATION STYLE

APA

Venkateswarlu, T., & Anmala, J. (2024). Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest. Environment, Development and Sustainability, 26(9), 23961–23984. https://doi.org/10.1007/s10668-023-03630-1

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free