Abstract
Unsupervised features based on word representations such as word embeddings and word collocations have shown to significantly improve supervised NER for English. In this work we investigate whether such unsupervised features can also boost supervised NER in Spanish. To do so, we use word representations and collocations as additional features in a linear chain Conditional Random Field (CRF) classifier. Experimental results (82.44% F-score on the CoNLL-2002 corpus) show that our approach is comparable to some state-of-art Deep Learning approaches for Spanish, in particular when using cross-lingual word representations.
Author supplied keywords
Cite
CITATION STYLE
Copara, J., Ochoa, J., Thorne, C., & Glavăs, G. (2016). Conditional random fields for Spanish named entity recognition using unsupervised features. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10022 LNAI, pp. 175–186). Springer Verlag. https://doi.org/10.1007/978-3-319-47955-2_15
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.