Choosing learning algorithms using sign tests with high replicability

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

An important task in machine learning is determining which learning algorithm works best for a given data set. When the amount of data is small the same data needs to be used repeatedly in order to get a reasonable estimate of the accuracy of the learning algorithms. This results in violations of assumptions on which standard tests are based and makes it hard to design a good test. In this article, we investigate sign tests to address the problem of choosing the best of two learning algorithms when only a small data set is available. Sign tests are conceptually simple and no assumption about underlying distributions is required.We showtha t simplistic sample generation can lead to flawed test outcomes. Furthermore, we identify a test that performs well based on Type I error (showing a difference between algorithms when there is none), power (showing a difference when it indeed exists) and replicability. Replicability is a novel measure of a quality of a test that gives an indication how likely it is that the test outcome will be the same when the same test on the same data with the same sampling scheme and same pair of algorithms is executed, but with a different randomization of the data. A newd efinition of replicability is provided and its benefits highlighted. Empirical evidence is provided to showthe test is robust under a varied range of circumstances.

Cite

CITATION STYLE

APA

Bouckaert, R. R. (2003). Choosing learning algorithms using sign tests with high replicability. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2903, pp. 710–722). Springer Verlag. https://doi.org/10.1007/978-3-540-24581-0_61

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free