Abstract
Term weighting is a preprocessing phase that has an important role in the text classification by giving the appropriate weight for each term in all documents. In previous research, many supervised term weighting methods have been introduced, but most of the supervised term weighting only considers the distribution of terms in the two classes so that it is not optimal for the multi-class classification. This paper introduces a new supervised weighting with association concept to optimize term weighting distributions in multi-class cases by considering terms that exist in each class and paying attention to the number of terms in the document belonging to the class, also considering the relationship pattern between one or more items with association concept in a dataset to measure the strength of terms in a class by using confidence values. The dataset used are the data twitter taken from the PR FM twitter account. The proposed supervised term weighting method implemented with SVM classifier can outperform unsupervised weighting schemes such as TF-IDF with the average accuracy 81.704%.
Cite
CITATION STYLE
Izzah, I. K., & Girsang, A. S. (2020). Association on Supervised Term Weighting Method for Classification on Data Twitter. International Journal of Recent Technology and Engineering (IJRTE), 8(6), 859–863. https://doi.org/10.35940/ijrte.f6975.038620
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.