Disease gene identification is of great significance for the treatment of genetic disorders. In recent years, the rapid development of high-throughput sequencing technologies has brought great revolution for disease gene identification methods. Network-based methods are now the most efficient component for disease gene identification, while the most of current methods pay only attention to the local topological attributes regardless of the global distribution. In this paper, we proposed to apply the random walk algorithm to extract global features for each gene and finally used binary logistic regression model to identify whether a gene belongs to the given disease. We also integrate the local features and global features into a complex feature vector to improve the identification performance. The experimental results show that the global feature is of great efficiency for disease gene identification. We organize the global feature into different kinds of feature vectors and we can get higher AUC scores than other state-of-the-art methods for all these feature vectors.
CITATION STYLE
Wei, L., Wu, S., Zhang, J., & Xu, Y. (2016). Random walk based global feature for disease gene identification. In Communications in Computer and Information Science (Vol. 663, pp. 464–473). Springer Verlag. https://doi.org/10.1007/978-981-10-3005-5_38
Mendeley helps you to discover research relevant for your work.