ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data

Matúš Medo; Daniel M. Aebersold; Michaela Medová

Journal ArticleOPEN ACCESS

ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data

BMC Bioinformatics (2019) 20(1)

DOI: 10.1186/s12859-019-3144-3

6Citations

64Readers

Abstract

Background: Data from discovery proteomic and phosphoproteomic experiments typically include missing values that correspond to proteins that have not been identified in the analyzed sample. Replacing the missing values with random numbers, a process known as "imputation", avoids apparent infinite fold-change values. However, the procedure comes at a cost: Imputing a large number of missing values has the potential to significantly impact the results of the subsequent differential expression analysis. Results: We propose a method that identifies differentially expressed proteins by ranking their observed changes with respect to the changes observed for other proteins. Missing values are taken into account by this method directly, without the need to impute them. We illustrate the performance of the new method on two distinct datasets and show that it is robust to missing values and, at the same time, provides results that are otherwise similar to those obtained with edgeR which is a state-of-art differential expression analysis method. Conclusions: The new method for the differential expression analysis of proteomic data is available as an easy to use Python package.

Author supplied keywords

Cite

CITATION STYLE

APA

Medo, M., Aebersold, D. M., & Medová, M. (2019). ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data. BMC Bioinformatics, 20(1). https://doi.org/10.1186/s12859-019-3144-3

ProtRank: Bypassing the imputation of missing values in differential expression analysis of proteomic data

Abstract

Author supplied keywords

Cite

Register to see more suggestions