Improved transition-based parsing by modeling characters instead of words with LSTMs

Miguel Ballesteros; Chris Dyer; Noah A. Smith

Conference Proceedings

Improved transition-based parsing by modeling characters instead of words with LSTMs

Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (2015) 349-359

DOI: 10.18653/v1/d15-1041

181Citations

299Readers

Get full text

Abstract

We present extensions to a continuousstate dependency parsing method that makes it applicable to morphologically rich languages. Starting with a highperformance transition-based parser that uses long short-term memory (LSTM) recurrent neural networks to learn representations of the parser state, we replace lookup-based word representations with representations constructed from the orthographic representations of the words, also using LSTMs. This allows statistical sharing across word forms that are similar on the surface. Experiments for morphologically rich languages show that the parsing model benefits from incorporating the character-based encodings of words.

Cite

CITATION STYLE

APA

Ballesteros, M., Dyer, C., & Smith, N. A. (2015). Improved transition-based parsing by modeling characters instead of words with LSTMs. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 349–359). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1041

Improved transition-based parsing by modeling characters instead of words with LSTMs

Abstract

Cite

Register to see more suggestions