pvclass: An R package for p values for classification

1Citations
Citations of this article
15Readers
Mendeley users who have this article in their library.

Abstract

Let (X, Y) be a random variable consisting of an observed feature vector X and an unobserved class label Y ∈ {1, 2,…, L} with unknown joint distribution. In addition, let D be a training data set consisting of n completely observed independent copies of (X, Y). Instead of providing point predictors (classifiers) for Y, we compute for each b ∈ {1, 2,…, L} a p value πb(X, D) for the null hypothesis that Y = b, treating Y temporarily as a fixed parameter, i.e., we construct a prediction region for Y with a certain confidence. The advantages of this approach over more traditional ones are reviewed briefly. In principle, any reasonable classifier can be modified to yield nonparametric p values. We describe the R package pvclass which computes nonparametric p values for the potential class memberships of new observations as well as cross-validated p values for the training data. Additionally, it provides graphical displays and quantitative analyses of the p values.

Cite

CITATION STYLE

APA

Zumbrunnen, N., & Dümbgen, L. (2017). pvclass: An R package for p values for classification. Journal of Statistical Software, 78. https://doi.org/10.18637/jss.v078.i04

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free