Research on the semantic argument classification requires semantically labeled data in large numbers, called corpus. Because building a corpus is costly and time-consuming, recently many studies have used existing corpus as the training data to conduct semantic argument classification research on new domain. But previous studies have proven that there is a significant decrease in performance when classifying semantic arguments on different domain between the training and the testing data. The main problem is when there is a new argument that found in the testing data but it is not found in the training data. This research carries on semantic argument classification on a new domain that is Quran English Translation by utilizing Propbank corpus as the training data. To recognize the new argument in the training data, this research proposes four new features for extending the argument features in the training data. By using SVM Linear, the experiment has proven that augmenting the proposed features to the baseline system with some combinations option improve the performance of semantic argument classification on Quran data using Propbank Corpus as training data.
CITATION STYLE
Batubara, D. K., Bijaksana, M. A., & Adiwijaya. (2018). On feature augmentation for semantic argument classification of the Quran English translation using support vector machine. In Journal of Physics: Conference Series (Vol. 971). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/971/1/012043
Mendeley helps you to discover research relevant for your work.