Even though Support Vector Machines (SVMs) are capable of identifying patterns in high dimensional kernel spaces, their performance is determined by two main factors: SVM cost parameter and kernel parameters. This paper identifies a mechanism to extract meta features from string datasets, and derives a n-gram string kernel SVM optimization method. In the method, a meta model is trained over computed string meta-features for each dataset from a string dataset pool, learning algorithm parameters, and accuracy information to predict the optimal parameter combination for a given string classification task. In the experiments, the n-gram SVM were optimized using the proposed algorithm over four string datasets: spam, Reuters-21578, Network Application Detection and e-News Categorization. The experiment results revealed that the proposed algorithm was able to produce parameter combinations which yield good string classification accuracies for n-gram SVM on all string datasets. © 2010 Springer-Verlag.
CITATION STYLE
Gunasekara, N., Pang, S., & Kasabov, N. (2010). Tuning N-gram string kernel SVMs via meta learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6444 LNCS, pp. 91–98). https://doi.org/10.1007/978-3-642-17534-3_12
Mendeley helps you to discover research relevant for your work.