Evaluation of Cosine Similarity Feature for Named Entity Recognition on Tweets

3Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper, we present the Named Entity Recognition as a multi-class system and we evaluate baseline classifiers along with the technical features extracted from tweet datasets. Initially, we elaborate the conversion procedure of tweet data and we study three different datasets such that raw tweet data are compatible to the presented data model. The first dataset is well-known for benchmarking purposes and the other two datasets have been collected in the wild by using Twitter search API and given keywords. Then, we elaborate the feature vector constituted by 9 technical features. To reach at higher statistical metric values of the multi-class NER system, we seek the performance of the classifier subject to different combination of features. Finally, we elaborate the impact of the cosine similarity to the class centroid feature to the performance of the classifiers and we present the highest F1 score reached by using a particular set of features.

Cite

CITATION STYLE

APA

Büyüktopaç, O., & Acarman, T. (2020). Evaluation of Cosine Similarity Feature for Named Entity Recognition on Tweets. In Advances in Intelligent Systems and Computing (Vol. 1061, pp. 125–135). Springer. https://doi.org/10.1007/978-3-030-31964-9_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free