Supervised machine learning for multi-label classification of bangla articles

Dip Bhakta; Avimonnu Arnob Dash; Md Faisal Bari; Swakkhar Shatabda

Conference Proceedings

Supervised machine learning for multi-label classification of bangla articles

Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (2020) 325 LNICST 477-487

DOI: 10.1007/978-3-030-52856-0_38

2Citations

1Readers

Get full text

Abstract

Multi-label text classification has been a key point of research in the area of text classification latterly. But to the best of our knowledge, there have been very few research on multi-label text classification for Bangla text. There is also inadequacy of proper dataset for multi-label classification on Bangla text. Multi-label classification has many applications in the real world. One of them is automated labeling of articles of online news portals so that readers can easily look up other news articles on similar topics by clicking on hyperlinks. We applied supervised multi-label classification techniques on Bangla news articles for automated tag generation to predict related topics. We have built a new dataset from scratch and applied various problem transformation methods for multi-label classification with naive bayes classifier, logistic regression and SVM. We have analyzed the performance of these algorithms on Bangla news articles with precision, recall, f1-score and hamming loss. The dataset and the analysis of the results can be valuable for further research on multi-label text classification of Bangla text. We have open-sourced the dataset and the source code of this work (http://bit.ly/34cSNCR).

Author supplied keywords

Cite

CITATION STYLE

APA

Bhakta, D., Dash, A. A., Bari, M. F., & Shatabda, S. (2020). Supervised machine learning for multi-label classification of bangla articles. In Lecture Notes of the Institute for Computer Sciences, Social-Informatics and Telecommunications Engineering, LNICST (Vol. 325 LNICST, pp. 477–487). Springer. https://doi.org/10.1007/978-3-030-52856-0_38

Supervised machine learning for multi-label classification of bangla articles

Abstract

Author supplied keywords

Cite

Register to see more suggestions