Improving SVM text classification performance through threshold adjustment

James G. Shanahan; Norbert Roma

Conference ProceedingsOPEN ACCESS

Improving SVM text classification performance through threshold adjustment

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2003) 2837 361-372

DOI: 10.1007/978-3-540-39857-8_33

31Citations

28Readers

Abstract

In general, support vector machines (SVM), when applied to text classification provide excellent precision, but poor recall. One means of customizing SVMs to improve recall, is to adjust the threshold associated with an SVM. We describe an automatic process for adjusting the thresholds of generic SVM which incorporates a user utility model, an integral part of an information management system. By using thresholds based on utility models and the ranking properties of classifiers, it is possible to overcome the precision bias of SVMs and insure robust performance in recall across a wide variety of topics, even when training data are sparse. Evaluations on TREC data show that our proposed threshold adjusting algorithm boosts the performance of baseline SVMs by at least 20% for standard information retrieval measures.

Cite

CITATION STYLE

APA

Shanahan, J. G., & Roma, N. (2003). Improving SVM text classification performance through threshold adjustment. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 2837, pp. 361–372). Springer Verlag. https://doi.org/10.1007/978-3-540-39857-8_33

Improving SVM text classification performance through threshold adjustment

Abstract

Cite

Register to see more suggestions