Random forests for Premise Selection

Michael Färber; Cezary Kaliszyk

Conference Proceedings

Random forests for Premise Selection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9322 325-340

DOI: 10.1007/978-3-319-24246-0_20

9Citations

6Readers

Get full text

Abstract

The success rates of automated theorem provers in large theories highly depend on the choice of given facts. Premise selection is the task of choosing a subset of given facts, which is most likely to lead to a successful automated deduction proof of a given conjecture. Premise selection can be viewed as a multi-label classification problem, where machine learning from related proofs turns out to currently be the most successful method. Random forests are a machine learning technique known to perform especially well on large datasets. In this paper, we evaluate random forest algorithms for premise selection. To deal with the specifics of automated reasoning, we propose a number of extensions to random forests, such as incremental learning, multi-path querying, depth weighting, feature IDF (inverse document frequency), and integration of secondary classifiers in the tree leaves. With these extensions, we improve on the k-nearest neighbour algorithm both in terms of prediction quality and ATP performance.

Cite

CITATION STYLE

APA

Färber, M., & Kaliszyk, C. (2015). Random forests for Premise Selection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9322, pp. 325–340). Springer Verlag. https://doi.org/10.1007/978-3-319-24246-0_20

Random forests for Premise Selection

Abstract

Cite

Register to see more suggestions