Character-based bidirectional lstm-crf with words and characters for japanese named entity recognition

Shotaro Misawa; Motoki Taniguchi; Yasuhide Miura; Tomoko Ohkuma

Conference ProceedingsOPEN ACCESS

Character-based bidirectional lstm-crf with words and characters for japanese named entity recognition

EMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop (2017) 97-102

DOI: 10.18653/v1/w17-4114

59Citations

136Readers

Abstract

Recently, neural models have shown superior performance over conventional models in NER tasks. These models use CNN to extract sub-word information along with RNN to predict a tag for each word. However, these models have been tested almost entirely on English texts. It remains unclear whether they perform similarly in other languages. We worked on Japanese NER using neural models and discovered two obstacles of the state-ofthe- art model. First, CNN is unsuitable for extracting Japanese sub-word information. Secondly, a model predicting a tag for each word cannot extract an entity when a part of a word composes an entity. The contributions of this work are (i) verifying the effectiveness of the state-of-theart NER model for Japanese, (ii) proposing a neural model for predicting a tag for each character using word and character information. Experimentally obtained results demonstrate that our model outperforms the state-of-the-art neural English NER model in Japanese.

Cite

CITATION STYLE

APA

Misawa, S., Taniguchi, M., Miura, Y., & Ohkuma, T. (2017). Character-based bidirectional lstm-crf with words and characters for japanese named entity recognition. In EMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop (pp. 97–102). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-4114

Character-based bidirectional lstm-crf with words and characters for japanese named entity recognition

Abstract

Cite

Register to see more suggestions