Neuro-Symbolic Reinforcement Learning with First-Order Logic

Daiki Kimura; Masaki Ono; Subhajit Chaudhury; Ryosuke Kohita; Akifumi Wachi; Don Joven Agravante; Michiaki Tatsubori; Asim Munawar; Alexander Gray

Conference ProceedingsOPEN ACCESS

Neuro-Symbolic Reinforcement Learning with First-Order Logic

EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (2021) 3505-3511

DOI: 10.18653/v1/2021.emnlp-main.283

23Citations

99Readers

Abstract

Deep reinforcement learning (RL) methods often require many trials before convergence, and no direct interpretability of trained policies is provided. In order to achieve fast convergence and interpretability for the policy in RL, we propose a novel RL method for text-based games with a recent neuro-symbolic framework called Logical Neural Network, which can learn symbolic and interpretable rules in their differentiable network. The method is first to extract first-order logical facts from text observation and external word meaning network (ConceptNet), then train a policy in the network with directly interpretable logical operators. Our experimental results show RL training with the proposed method converges significantly faster than other state-of-the-art neuro-symbolic methods in a TextWorld benchmark.

Cite

CITATION STYLE

APA

Kimura, D., Ono, M., Chaudhury, S., Kohita, R., Wachi, A., Agravante, D. J., … Gray, A. (2021). Neuro-Symbolic Reinforcement Learning with First-Order Logic. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 3505–3511). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.283

Neuro-Symbolic Reinforcement Learning with First-Order Logic

Abstract

Cite

Register to see more suggestions