Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph

23Citations
Citations of this article
59Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

In order to proactively mitigate cyber-security risks, security analysts have to continuously monitor sources of threat information. However, the sheer amount of textual information that needs to be processed is overwhelming, and it requires a great deal of mundane labor to separate the threats from the noise. We propose a novel approach to represent the relevance and significance of the cyber-security text in quantitative numbers. We trained custom Named Entity Recognition (NER) model and constructed a Cyber-security Knowledge Graph (CKG) to infer the subjective relevance of the cyber-security text to the user and to generate correlation features. In addition, the significance of the given text was analyzed in terms of its textual similarity with different repositories of pre-defined ‘‘significant’’ text and the maximum similarities were computed. These analysis results then act as features of the classifier to generate the significance score. The experimental result showed that the overall system could determine the significance and relevance of the text within a controlled environment with 88% accuracy.

Cite

CITATION STYLE

APA

Mendsaikhan, O., Hasegawa, H., Yamaguchi, Y., & Shimada, H. (2020). Quantifying the significance and relevance of cyber-security text through textual similarity and cyber-security knowledge graph. IEEE Access, 8, 177041–177052. https://doi.org/10.1109/ACCESS.2020.3027321

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free