ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data

6Citations
Citations of this article
64Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Background: Data from discovery proteomic and phosphoproteomic experiments typically include missing values that correspond to proteins that have not been identified in the analyzed sample. Replacing the missing values with random numbers, a process known as "imputation", avoids apparent infinite fold-change values. However, the procedure comes at a cost: Imputing a large number of missing values has the potential to significantly impact the results of the subsequent differential expression analysis. Results: We propose a method that identifies differentially expressed proteins by ranking their observed changes with respect to the changes observed for other proteins. Missing values are taken into account by this method directly, without the need to impute them. We illustrate the performance of the new method on two distinct datasets and show that it is robust to missing values and, at the same time, provides results that are otherwise similar to those obtained with edgeR which is a state-of-art differential expression analysis method. Conclusions: The new method for the differential expression analysis of proteomic data is available as an easy to use Python package.

Cite

CITATION STYLE

APA

Medo, M., Aebersold, D. M., & Medová, M. (2019). ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data. BMC Bioinformatics, 20(1). https://doi.org/10.1186/s12859-019-3144-3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free