THU_NGN at SemEval-2019 task 12: Toponym detection and disambiguation on scientific papers

2Citations
Citations of this article
73Readers
Mendeley users who have this article in their library.

Abstract

Toponym resolution is an important and challenging task in the neural language processing field, and has wide applications such as emergency response and social media geographical event analysis. Toponym resolution can be roughly divided into two independent steps, i.e., toponym detection and toponym disambiguation. In order to facilitate the study on toponym resolution, the SemEval 2019 task 12 is proposed, which contains three subtasks, i.e., toponym detection, toponym disambiguation and toponym resolution. In this paper, we introduce our system that participated in the SemEval 2019 task 12. For toponym detection, in our approach we use TagLM as the basic model, and explore the use of various features in this task, such as word embeddings extracted from pre-trained language models, POS tags and lexical features extracted from dictionaries. For toponym disambiguation, we propose a heuristics rule-based method using toponym frequency and population. Our systems achieved 83.03% strict macro F1, 74.50 strict micro F1, 85.92 overlap macro F1 and 78.47 overlap micro F1 in toponym detection subtask.

Cite

CITATION STYLE

APA

Qi, T., Ge, S., Wu, C., Chen, Y., & Huang, Y. (2019). THU_NGN at SemEval-2019 task 12: Toponym detection and disambiguation on scientific papers. In NAACL HLT 2019 - International Workshop on Semantic Evaluation, SemEval 2019, Proceedings of the 13th Workshop (pp. 1302–1307). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s19-2229

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free