Discovering New Sensitive Words Based on Sensitive Information Categorization

Panyu Liu; Yangyang Li; Zhiping Cai; Shuhui Chen

Conference Proceedings

Discovering New Sensitive Words Based on Sensitive Information Categorization

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11632 LNCS 338-346

DOI: 10.1007/978-3-030-24274-9_30

0Citations

2Readers

Get full text

Abstract

Sensitive word detection has popped out nowadays as the prosperity of internet technologies emerges. At the same time, some internet users diffuse sensitive contents which contains unhealthy information. But how to improve sensitive information classification accuracy and find new sensitive words has been an urgent demand in the network information security. On the one hand, the sensitive information classification result inaccurate, on the other hand, all the research methods can not find the new sensitive information, in other word, it does not automatically identify new sensitive information. We mainly improved the existing outstanding machine learning classification algorithm, experimental results show that this method can significantly improve the classification accuracy. Beside, by researching word similarity algorithm base on HowNet and CiLin, we can realize expanding the database of sensitive words continually (i.e., discovery the new sensitive word). Through the methodologies mentioned above, we have got a better accuracy and realized new sensitive word discovery technology which will be analyzed and presented in the paper.

Author supplied keywords

Cite

CITATION STYLE

APA

Liu, P., Li, Y., Cai, Z., & Chen, S. (2019). Discovering New Sensitive Words Based on Sensitive Information Categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11632 LNCS, pp. 338–346). Springer Verlag. https://doi.org/10.1007/978-3-030-24274-9_30

Discovering New Sensitive Words Based on Sensitive Information Categorization

Abstract

Author supplied keywords

Cite

Register to see more suggestions