Developing an ensembled machine learning prediction model for marine fish and aquaculture production

13Citations
Citations of this article
63Readers
Mendeley users who have this article in their library.

Abstract

The fishing industry is identified as a strategic sector to raise domestic protein production and supply in Malaysia. Global changes in climatic variables have impacted and continue to impact marine fish and aquaculture production, where machine learning (ML) methods are yet to be extensively used to study aquatic systems in Malaysia. ML-based algorithms could be paired with feature importance, i.e., (features that have the most predictive power) to achieve better prediction accuracy and can provide new insights on fish production. This research aims to develop an MLbased prediction of marine fish and aquaculture production. Based on the feature importance scores, we select the group of climatic variables for three different ML models: linear, gradient boosting, and random forest regression. The past 20 years (2000–2019) of climatic variables and fish production data were used to train and test the ML models. Finally, an ensemble approach named voting regression combines those three ML models. Performance matrices are generated and the results showed that the ensembled ML model obtains R2 values of 0.75, 0.81, and 0.55 for marine water, freshwater, and brackish water, respectively, which outperforms the single ML model in predicting all three types of fish production (in tons) in Malaysia.

Cite

CITATION STYLE

APA

Rahman, L. F., Marufuzzaman, M., Alam, L., Bari, M. A., Sumaila, U. R., & Sidek, L. M. (2021). Developing an ensembled machine learning prediction model for marine fish and aquaculture production. Sustainability (Switzerland), 13(16). https://doi.org/10.3390/su13169124

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free