Model-based offline reinforcement learning for sustainable fishery management

Jun Ju; Hanna Kurniawati; Dirk Kroese; Nan Ye

Journal ArticleOPEN ACCESS

Model-based offline reinforcement learning for sustainable fishery management

Expert Systems (2023)

DOI: 10.1111/exsy.13324

0Citations

11Readers

Abstract

Fisheries, as indispensable natural resources for human, need to be managed with both short-term economical benefits and long-term sustainability in consideration. This has remained a challenge, because the population and catch dynamics of the fisheries are complex and noisy, while the data available is often scarce and only provides partial information on the dynamics. To address these challenges, we formulate the population and catch dynamics as a Partially Observable Markov Decision Process (POMDP), and propose a model-based offline reinforcement learning approach to learn an optimal management policy. Our approach allows learning fishery management policies from possibly incomplete fishery data generated by a stochastic fishery system. This involves first learning a POMDP fishery model using a novel least squares approach, and then computing the optimal policy for the learned POMDP. The learned fishery dynamics model is useful for explaining the resulting policy's performance. We perform systematic and comprehensive simulation study to quantify the effects of stochasticity in fishery dynamics, proliferation rates, missing values in fishery data, dynamics model misspecification, and variability of effort (e.g., the number of boat days). When the effort is sufficiently variable and the noise is moderate, our method can produce a competitive policy that achieves 85% of the optimal value, even for the hardest case of noisy incomplete data and a misspecified model. Interestingly, the learned policies seem to be robust in the presence of model learning errors. However, non-identifiability kicks in if there is insufficient variability in the effort level and the fishery system is stochastic. This often results in poor policies, highlighting the need for sufficiently informative data. We also provide a theoretical analysis on model misspecification and discuss the tendency of a Schaefer model to overfit compared with a Beverton–Holt model.

Author supplied keywords

Cite

CITATION STYLE

APA

Ju, J., Kurniawati, H., Kroese, D., & Ye, N. (2023). Model-based offline reinforcement learning for sustainable fishery management. Expert Systems. https://doi.org/10.1111/exsy.13324

Model-based offline reinforcement learning for sustainable fishery management

Abstract

Author supplied keywords

Cite

Register to see more suggestions