Discovering New Sensitive Words Based on Sensitive Information Categorization

0Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Sensitive word detection has popped out nowadays as the prosperity of internet technologies emerges. At the same time, some internet users diffuse sensitive contents which contains unhealthy information. But how to improve sensitive information classification accuracy and find new sensitive words has been an urgent demand in the network information security. On the one hand, the sensitive information classification result inaccurate, on the other hand, all the research methods can not find the new sensitive information, in other word, it does not automatically identify new sensitive information. We mainly improved the existing outstanding machine learning classification algorithm, experimental results show that this method can significantly improve the classification accuracy. Beside, by researching word similarity algorithm base on HowNet and CiLin, we can realize expanding the database of sensitive words continually (i.e., discovery the new sensitive word). Through the methodologies mentioned above, we have got a better accuracy and realized new sensitive word discovery technology which will be analyzed and presented in the paper.

Cite

CITATION STYLE

APA

Liu, P., Li, Y., Cai, Z., & Chen, S. (2019). Discovering New Sensitive Words Based on Sensitive Information Categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11632 LNCS, pp. 338–346). Springer Verlag. https://doi.org/10.1007/978-3-030-24274-9_30

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free