Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest

9Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Surface waters are essential for meeting the needs of the world. In many regions, stream water quality is a major concern due to contamination from multiple sources. Stream water is also susceptible to climatic events and land-use practices influencing its catchment. Understanding the impact of such events on stream water quality is crucial for managing and protecting aquatic ecosystems and providing safe drinking water to communities that rely on these streams. Hence, monitoring and evaluating stream water quality holds significance in identifying potential hazards and implementing suitable management strategies. In this paper, a novel effort was made to determine the relative feature importance of a set of watershed characteristics (precipitation, temperature, urban land use, agricultural land use, and forest land-use factors) on four important water quality parameters (WQPs): fecal coliforms (FC), turbidity, pH, and conductivity of the Upper Green River watershed, Kentucky, USA. Random forest (RF), an ensemble learning method, was used to predict the WQPs from the causal parameters and determine the feature importance characteristics of the four WQPs previously mentioned. This model demonstrated that precipitation and temperature are the most influential factors on FC, turbidity, and pH. Forest land use and temperature are the two most important factors for conductivity. The novel feature importance factors of the RF model have likewise been confirmed for each WQP. In modeling stream WQPs, the developed the RF model outperformed the artificial neural network (ANN) model. Using the RF model, we obtain regression coefficients of (0.93, 0.74, and 0.94) for pH in training, testing, and overall. We obtain regression coefficients of (0.60, 0.64, and 0.61) using the ANN model. ⁠⁠⁠⁠⁠⁠⁠Overall, the RF model was more effective than the ANN model in modeling stream WQPs. The model identified precipitation and temperature as the most influential factors on FC, turbidity, and pH, while forest land use and temperature were the most important factors in determining conductivity. It is also found that land use factors are important to improve the accuracy of WQPs predictions from climate variables. The results of this study can be used by authorities to better understand and control pollution at the watershed scale.

References Powered by Scopus

Random forests

96941Citations
N/AReaders
Get full text

Ecological perspective on water quality goals

704Citations
N/AReaders
Get full text

Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data

392Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Modelling relationships between land use and water quality using statistical methods: A critical and applied review

9Citations
N/AReaders
Get full text

Accurate Forecast of Water Quality Index for Cholera Diseases using Two-Layered Stacked Machine Learning Algorithms

8Citations
N/AReaders
Get full text

Spatio-temporal variability of turbidity derived from Sentinel-2 in Reloncaví sound, Northern Patagonia, Chile

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Venkateswarlu, T., & Anmala, J. (2024). Importance of land use factors in the prediction of water quality of the Upper Green River watershed, Kentucky, USA, using random forest. Environment, Development and Sustainability, 26(9), 23961–23984. https://doi.org/10.1007/s10668-023-03630-1

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

43%

Researcher 3

43%

Lecturer / Post doc 1

14%

Readers' Discipline

Tooltip

Environmental Science 3

50%

Computer Science 2

33%

Engineering 1

17%

Save time finding and organizing research with Mendeley

Sign up for free