Boosting first-order clauses for large, skewed data sets

Louis Oliphant; Elizabeth Burnside; Jude Shavlik

Conference Proceedings

Boosting first-order clauses for large, skewed data sets

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 5989 LNAI 166-177

DOI: 10.1007/978-3-642-13840-9_15

0Citations

3Readers

Get full text

Abstract

Creating an effective ensemble of clauses for large, skewed data sets requires finding a diverse, high-scoring set of clauses and then combining them in such a way as to maximize predictive performance. We have adapted the RankBoost algorithm in order to maximize area under the recall-precision curve, a much better metric when working with highly skewed data sets than ROC curves. We have also explored a range of possibilities for the weak hypotheses used by our modified RankBoost algorithm beyond using individual clauses. We provide results on four large, skewed data sets showing that our modified RankBoost algorithm outperforms the original on area under the recall-precision curves. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Oliphant, L., Burnside, E., & Shavlik, J. (2010). Boosting first-order clauses for large, skewed data sets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5989 LNAI, pp. 166–177). https://doi.org/10.1007/978-3-642-13840-9_15

Boosting first-order clauses for large, skewed data sets

Abstract

Author supplied keywords

Cite

Register to see more suggestions