Evaluating network-based missing protein prediction using p -values, Bayes Factors, and probabilities

1Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Some prediction methods use probability to rank their predictions, while some other prediction methods do not rank their predictions and instead use p-values to support their predictions. This disparity renders direct cross-comparison of these two kinds of methods difficult. In particular, approaches such as the Bayes Factor upper Bound (BFB) for p-value conversion may not make correct assumptions for this kind of cross-comparisons. Here, using a well-established case study on renal cancer proteomics and in the context of missing protein prediction, we demonstrate how to compare these two kinds of prediction methods using two different strategies. The first strategy is based on false discovery rate (FDR) estimation, which does not make the same naïve assumptions as BFB conversions. The second strategy is a powerful approach which we colloquially call "home ground testing". Both strategies perform better than BFB conversions. Thus, we recommend comparing prediction methods by standardization to a common performance benchmark such as a global FDR. And where this is not possible, we recommend reciprocal "home ground testing".

Cite

CITATION STYLE

APA

Goh, W. W. B., Kong, W., & Wong, L. (2023). Evaluating network-based missing protein prediction using p -values, Bayes Factors, and probabilities. Journal of Bioinformatics and Computational Biology, 21(1). https://doi.org/10.1142/S0219720023500051

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free