A MapReduce-Based Parallel Random Forest Approach for Predicting Large-Scale Protein-Protein Interactions

2Citations
Citations of this article
1Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The protein-protein interactions (PPIs) play an important part in understanding cellular mechanisms. Recently, a number of computational approaches for predicting PPIs have been proposed. However, most of the existing methods are only suitable for relatively small-scale PPIs prediction. In this study, we propose a MapReduce-based parallel Random Forest model for predicting large-scale PPIs using only proteins sequence information. More specifically, the Moran autocorrelation descriptor is firstly used to extract the local features from protein sequence. Then, the MapReduce-based parallel Random Forest model is utilized to perform PPIs prediction. In the experiment, the proposed method greatly reduces the required time to train the model, while maintaining the high accuracy in the prediction of potential PPIs. The promising results demonstrate that our method can be used as an efficient tool in the field of large-scale PPIs prediction, which greatly reduces the required training time and has high prediction accuracy.

Cite

CITATION STYLE

APA

Ji, B. Y., You, Z. H., Yang, L., Zhou, J. R., & Hu, P. W. (2020). A MapReduce-Based Parallel Random Forest Approach for Predicting Large-Scale Protein-Protein Interactions. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12465 LNAI, pp. 400–407). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-60796-8_34

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free