Feature selection is considered as one of the essential steps in data pre-processing. However, all of the previous studies on predicting PM10 concentration in Malaysia have been limited to statistical method feature selection, and none of these studies used machine-learning approaches. Therefore, the objective of this research is to investigate the influence variables of the PM10 prediction model by using wrapper feature selection to compare the prediction model performance of different wrapper feature selection and to predict the concentration of PM10 for the next day. This research uses 10 years of daily data on pollutant concentrations from two stations (Klang and Shah Alam) obtained from the Department of Environment Malaysia (DOE) from 2009 until 2018. Six wrapper methods (forward selection, backward elimination, stepwise, brute-force, weight-guided and genetic algorithm evolution and the predictive analytics multiple linear regression (MLR) and artificial neural network (ANN)) were implemented in this study. This study found that brute-force is the dominant wrapper method in most of the best models in selecting important features for MLR. Moreover, compared to MLR, ANN provides more advantages regarding model accuracy and permits feature selection in predicting PM10. The overall results revealed that the RMSE value for next day prediction in Klang is 20.728, while the AE value is 15.69. Furthermore, the RMSE value for next day prediction in Shah Alam is 10.004, while the AE value is 7.982. Finally, all of the predicted models in Klang and Shah Alam can be used to predict the PM10 concentrations. This proposed model can be used as a tool for an early warning system in giving air quality information to local authorities in order to formulate air-quality-improvement strategies.
CITATION STYLE
Ul-Saufie, A. Z., Hamzan, N. H., Zahari, Z., Shaziayani, W. N., Noor, N. M., Zainol, M. R. R. M. A., … Vizureanu, P. (2022). Improving Air Pollution Prediction Modelling Using Wrapper Feature Selection. Sustainability (Switzerland), 14(18). https://doi.org/10.3390/su141811403
Mendeley helps you to discover research relevant for your work.