Policy gradient as a proxy for dynamic oracles in constituency parsing

Daniel Fried; Dan Klein

Conference ProceedingsOPEN ACCESS

Policy gradient as a proxy for dynamic oracles in constituency parsing

ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (2018) 2 469-476

DOI: 10.18653/v1/p18-2075

18Citations

100Readers

Abstract

Dynamic oracles provide strong supervision for training constituency parsers with exploration, but must be custom defined for a given parser’s transition system. We explore using a policy gradient method as a parser-agnostic alternative. In addition to directly optimizing for a tree-level metric such as F1, policy gradient has the potential to reduce exposure bias by allowing exploration during training; moreover, it does not require a dynamic oracle for supervision. On four constituency parsers in three languages, the method substantially outperforms static oracle likelihood training in almost all settings. For parsers where a dynamic oracle is available (including a novel oracle which we define for the transition system of Dyer et al. (2016)), policy gradient typically recaptures a substantial fraction of the performance gain afforded by the dynamic oracle.

Cite

CITATION STYLE

APA

Fried, D., & Klein, D. (2018). Policy gradient as a proxy for dynamic oracles in constituency parsing. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers) (Vol. 2, pp. 469–476). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p18-2075

Policy gradient as a proxy for dynamic oracles in constituency parsing

Abstract

Cite

Register to see more suggestions