Cutpointr: Improved estimation and validation of optimal cutpoints in r

298Citations
Citations of this article
167Readers
Mendeley users who have this article in their library.
Get full text

Abstract

“Optimal cutpoints” for binary classification tasks are often established by testing which cutpoint yields the best discrimination, for example the Youden index, in a specific sample. This results in “optimal” cutpoints that are highly variable and systematically overestimate the out-of-sample performance. To address these concerns, the cutpointr package offers robust methods for estimating optimal cutpoints and the out-of-sample performance. The robust methods include bootstrapping and smoothing based on kernel estimation, generalized additive models, smoothing splines, and local regression. These methods can be applied to a wide range of binary-classification and cost-based metrics. cutpointr also provides mechanisms to utilize user-defined metrics and estimation methods. The package has capabilities for parallelization of the bootstrapping, including reproducible random number generation. Furthermore, it is pipe-friendly, for example for compatibility with functions from tidyverse. Various functions for plotting receiver operating characteristic curves, precision recall graphs, bootstrap results and other representations of the data are included. The package contains example data from a study on psychological characteristics and suicide attempts suitable for applying binary classification algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Thiele, C., & Hirschfeld, G. (2021). Cutpointr: Improved estimation and validation of optimal cutpoints in r. Journal of Statistical Software, 98. https://doi.org/10.18637/jss.v098.i11

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free