An Efficient Feature Selection Model for IGBO Text

Ifeanyi-Reuben Nkechi J; Benson-Emenike Mercy E

Journal ArticleOPEN ACCESS

An Efficient Feature Selection Model for IGBO Text

Nkechi J I
Mercy E B

International Journal of Data Mining & Knowledge Management Process (2018) 8(6) 19-33

DOI: 10.5121/ijdkp.2018.8602

N/ACitations

5Readers

Abstract

The development in Information Technology (IT) has encouraged the use of Igbo Language in text creation, online news reporting, online searching and articles publications. As the information stored in text format of this language is increasing, there is need for an intelligent text-based system for proper management of the data. The selection of optimal set of features for processing plays vital roles in text-based system. This paper analyzed the structure of Igbo text and designed an efficient feature selection model for an intelligent Igbo text-based system. It adopted Mean TF-IDF measure to select most relevant features on Igbo text documents represented with two word-based n-gram text representation (unigram and bigram) models. The model is designed with Object-Oriented Methodology and implemented with Python programming language with tools from Natural Language Toolkits (NLTK). The result shows that bigram represented text gives more relevant features based on the language semantics.

Cite

CITATION STYLE

APA

Nkechi J, I.-R., & Mercy E, B.-E. (2018). An Efficient Feature Selection Model for IGBO Text. International Journal of Data Mining & Knowledge Management Process, 8(6), 19–33. https://doi.org/10.5121/ijdkp.2018.8602

An Efficient Feature Selection Model for IGBO Text

Abstract

Cite

Register to see more suggestions