Metabolite-disease association prediction algorithm combining DeepWalk and random forest

26Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

Identifying the association between metabolites and diseases will help us understand the pathogenesis of diseases, which has great significance in diagnosing and treating diseases. However, traditional biometric methods are time consuming and expensive. Accordingly, we propose a new metabolite-disease association prediction algorithm based on DeepWalk and random forest (DWRF), which consists of the following key steps: First, the semantic similarity and information entropy similarity of diseases are integrated as the final disease similarity. Similarly, molecular fingerprint similarity and information entropy similarity of metabolites are integrated as the final metabolite similarity. Then, DeepWalk is used to extract metabolite features based on the network of metabolite-gene associations. Finally, a random forest algorithm is employed to infer metabolite-disease associations. The experimental results show that DWRF has good performances in terms of the area under the curve value, leave-one-out cross-validation, and five-fold cross-validation. Case studies also indicate that DWRF has a reliable performance in metabolite-disease association prediction.

Cite

CITATION STYLE

APA

Tie, J., Lei, X., & Pan, Y. (2022). Metabolite-disease association prediction algorithm combining DeepWalk and random forest. Tsinghua Science and Technology, 27(1), 58–67. https://doi.org/10.26599/TST.2021.9010003

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free