DM NLP at SemEval-2018 Task 8: Neural Sequence Labeling with Linguistic Features

9Citations
Citations of this article
70Readers
Mendeley users who have this article in their library.

Abstract

This paper describes our submissions for SemEval-2018 Task 8: Semantic Extraction from CybersecUrity REports using NLP. The DM NLP participated in two subtasks: SubTask 1 classifies if a sentence is useful for inferring malware actions and capabilities, and SubTask 2 predicts token labels (”Action”,”Entity”,”Modifier” and”Others”) for a given malware-related sentence. Since we leverage results of Subtask 2 directly to infer the result of Subtask 1, the paper focus on the system solving Subtask 2. By taking Subtask 2 as a sequence labeling task, our system relies on a recurrent neural network named BiLSTM-CNN-CRF with rich linguistic features, such as POS tags, dependency parsing labels, chunking labels, NER labels, Brown clustering. Our system achieved the highest F1 score in both token level and phrase level.

Cite

CITATION STYLE

APA

Ma, C., Zheng, H., Xie, P., Li, C., Li, L., & Luo, S. (2018). DM NLP at SemEval-2018 Task 8: Neural Sequence Labeling with Linguistic Features. In NAACL HLT 2018 - International Workshop on Semantic Evaluation, SemEval 2018 - Proceedings of the 12th Workshop (pp. 707–711). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s18-1114

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free