The development in Information Technology (IT) has encouraged the use of Igbo Language in text creation, online news reporting, online searching and articles publications. As the information stored in text format of this language is increasing, there is need for an intelligent text-based system for proper management of the data. The selection of optimal set of features for processing plays vital roles in text-based system. This paper analyzed the structure of Igbo text and designed an efficient feature selection model for an intelligent Igbo text-based system. It adopted Mean TF-IDF measure to select most relevant features on Igbo text documents represented with two word-based n-gram text representation (unigram and bigram) models. The model is designed with Object-Oriented Methodology and implemented with Python programming language with tools from Natural Language Toolkits (NLTK). The result shows that bigram represented text gives more relevant features based on the language semantics.
CITATION STYLE
Nkechi J, I.-R., & Mercy E, B.-E. (2018). An Efficient Feature Selection Model for IGBO Text. International Journal of Data Mining & Knowledge Management Process, 8(6), 19–33. https://doi.org/10.5121/ijdkp.2018.8602
Mendeley helps you to discover research relevant for your work.