Avoiding C-hacking when evaluating survival distribution predictions with discrimination measures

9Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Motivation: In this article, we consider how to evaluate survival distribution predictions with measures of discrimination. This is non-Trivial as discrimination measures are the most commonly used in survival analysis and yet there is no clear method to derive a risk prediction from a distribution prediction. We survey methods proposed in literature and software and consider their respective advantages and disadvantages. Results: Whilst distributions are frequently evaluated by discrimination measures, we find that the method for doing so is rarely described in the literature and often leads to unfair comparisons or 'C-hacking'. We demonstrate by example how simple it can be to manipulate results and use this to argue for better reporting guidelines and transparency in the literature. We recommend that machine learning survival analysis software implements clear transformations between distribution and risk predictions in order to allow more transparent and accessible model evaluation.

References Powered by Scopus

Multivariable prognostic models: Issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors

8135Citations
N/AReaders
Get full text

Evaluating the Yield of Medical Tests

2724Citations
N/AReaders
Get full text

Applied Survival Analysis: Regression Modeling of Time to Event Data: Second Edition

2183Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Deep learning for survival analysis: a review

26Citations
N/AReaders
Get full text

Applied Machine Learning Using mlr3 in R

12Citations
N/AReaders
Get full text

Explaining the optimistic performance evaluation of newly proposed methods: A cross-design validation experiment

10Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Sonabend, R., Bender, A., & Vollmer, S. (2022). Avoiding C-hacking when evaluating survival distribution predictions with discrimination measures. Bioinformatics, 38(17), 4178–4184. https://doi.org/10.1093/bioinformatics/btac451

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 7

78%

Researcher 2

22%

Readers' Discipline

Tooltip

Nursing and Health Professions 6

50%

Computer Science 2

17%

Biochemistry, Genetics and Molecular Bi... 2

17%

Energy 2

17%

Save time finding and organizing research with Mendeley

Sign up for free