Investigation and Imitation of Human Captains' Maneuver Using Inverse Reinforcement Learning

  • Higaki T
  • Hashimoto H
  • Yoshioka H
N/ACitations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

Automatic collision avoidance is of significant importance to prevent maritime collisions. Although many studies have been conducted in recent years, autonomous system has not completely replaced human captains since it is still difficult to imitate their complicated decisions. Thus, the present paper tries to investigate and imitate experienced captains' maneuver using maximum entropy inverse reinforcement learning (MaxEnt IRL). We firstly verify that MaxEnt IRL can reproduce appropriate reward function from demonstrative trajectories. Afterwards, we conduct an experiment on a simulator where well-experienced captains maneuver in congested sea and estimate reward from the trajectories. Searching the route which maximizes the obtained reward, finally, we demonstrate the optimized route can avoid collision against multiple ships in compliance with the International Regulations for Preventing Collisions at Sea (COLREGs).

Cite

CITATION STYLE

APA

Higaki, T., Hashimoto, H., & Yoshioka, H. (2022). Investigation and Imitation of Human Captains’ Maneuver Using Inverse Reinforcement Learning. Journal of the Japan Society of Naval Architects and Ocean Engineers, 36(0), 137–148. https://doi.org/10.2534/jjasnaoe.36.137

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free