Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections

Arturo Montejo Ráez; Luís Alfonso Ureña López; Ralf Steinberger

Conference Proceedings

Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 3230 1-12

DOI: 10.1007/978-3-540-30228-5_1

15Citations

13Readers

Get full text

Abstract

In this paper we present the problem found when studying an automated text categorization system for a collection of High Energy Physics (HEP) papers, which shows a very large number of possible classes (over 1,000) with highly imbalanced distribution. The collection is introduced to the scientific community and its imbalance is studied applying a new indicator: the inner imbalance degree. The one-against-all approach is used to perform multi-label assignment using Support Vector Machines. Over-weighting of positive samples and S-Cut thresholding is compared to an approach to automatically select a classifier for each class from a set of candidates. We also found that it is possible to reduce computational cost of the classification task by discarding classes for which classifiers cannot be trained successfully.

Cite

CITATION STYLE

APA

Ráez, A. M., López, L. A. U., & Steinberger, R. (2004). Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 3230, pp. 1–12). Springer Verlag. https://doi.org/10.1007/978-3-540-30228-5_1

Adaptive selection of base classifiers in one-against-all learning for large multi-labeled collections

Abstract

Cite

Register to see more suggestions