Text categorization is a task of text mining/analytics which involves extracting useful information from unstructured resources followed by categorizing these documents. In this paper, we classify the TechTC dataset collected from various Web directories. We employed feature selection methods such as Gini index, chi-square, t-statistic, correlation which drastically reduced the model building time. Various neural network models such as probabilistic neural network, group method of data handling, multi layer perceptron yielded higher accuracies compared to other techniques applied in literature.
CITATION STYLE
Shravankumar, B., & Ravi, V. (2015). Text classification using ensemble features selection and data mining techniques. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8947, pp. 176–186). Springer Verlag. https://doi.org/10.1007/978-3-319-20294-5_16
Mendeley helps you to discover research relevant for your work.