In order to improve the performance of text classification and information retrieval in big data of electric power domain, we propose a novel Chinese language classification algorithm - De-word classification algorithm. Focusing on the key role played by the De-word in modern Chinese language, this algorithm examines Chinese text classification method from a unique angle. Besides, on the basis of traditional weighted algorithm, it designs a novel relevance weighting model - De-TFIDF, and achieves a higher correlation in text information retrieval. Experiments show that, De-word classification algorithm significantly improves the efficiency of text classification, significantly improved information retrieval performance.
CITATION STYLE
Guo, X., Sun, H., Wang, L., Qu, Z., & Ding, W. (2015). De-word classification algorithm based on the electric power of large data library retrieval. In Lecture Notes in Electrical Engineering (Vol. 330, pp. 590–602). Springer Verlag. https://doi.org/10.1007/978-3-662-45402-2_85
Mendeley helps you to discover research relevant for your work.