Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph

Otgonpurev Mendsaikhan; Hirokazu Hasegawa; Yukiko Yamaguchi; Hajime Shimada

Journal ArticleOPEN ACCESS

Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph

IEEE Access (2020) 8 177041-177052

DOI: 10.1109/ACCESS.2020.3027321

23Citations

59Readers

Abstract

In order to proactively mitigate cyber-security risks, security analysts have to continuously monitor sources of threat information. However, the sheer amount of textual information that needs to be processed is overwhelming, and it requires a great deal of mundane labor to separate the threats from the noise. We propose a novel approach to represent the relevance and significance of the cyber-security text in quantitative numbers. We trained custom Named Entity Recognition (NER) model and constructed a Cyber-security Knowledge Graph (CKG) to infer the subjective relevance of the cyber-security text to the user and to generate correlation features. In addition, the significance of the given text was analyzed in terms of its textual similarity with different repositories of pre-defined ‘‘significant’’ text and the maximum similarities were computed. These analysis results then act as features of the classifier to generate the significance score. The experimental result showed that the overall system could determine the significance and relevance of the text within a controlled environment with 88% accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Mendsaikhan, O., Hasegawa, H., Yamaguchi, Y., & Shimada, H. (2020). Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph. IEEE Access, 8, 177041–177052. https://doi.org/10.1109/ACCESS.2020.3027321

Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph

Abstract

Author supplied keywords

Cite

Register to see more suggestions