The use of textual data has increased exponentially in recent years due to the networking infrastructure such as Facebook, Twitter, Wikipedia, Blogs, and so one. Analysis of this massive textual data can help to automatically categorize and label new content. Before classification process, term weighting scheme is the crucial step for representing the documents in a way suitable for classification algorithms. In this paper, we are conducting a survey on the term weighting schemes and we propose an efficient term weighting scheme that provide a better classification accuracy than those obtening with the famous TF-IDF, the recent IF-IGM and the others term weighting schemes in the literature.
CITATION STYLE
Kandé, D., Camara, F., Marone, R. M., & Ndiaye, S. (2019). Vector space model of text classification based on inertia contribution of document. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 260, pp. 155–165). Springer Verlag. https://doi.org/10.1007/978-3-030-05198-3_14
Mendeley helps you to discover research relevant for your work.