An iterative hadoop-based ensemble data classification model on distributed medical databases

5Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

As the size and complexity of the online biomedical databases are growing day by day, finding an essential structure or unstructured patterns in the distributed biomedical applications has become more complex. Traditional Hadoop-based distributed decision tree models such as Probability based decision tree (PDT), Classification And Regression Tree (CART) and Multiclass Classification Decision Tree have failed to discover relational patterns, user-specific patterns and feature-based patterns, due to the large number of feature sets. These models depend on selection of relevant attributes and uniform data distribution. Data imbalance, indexing and sparsity are the three major issues in these distributed decision tree models. In this proposed model, an enhanced attributes selection ranking model and Hadoop-based decision tree model were implemented to extract the user-specific interesting patterns in online biomedical databases. Experimental results show that the proposed model has high true positive, high precision and low error rate compared to traditional distributed decision tree models.

Cite

CITATION STYLE

APA

Bikku, T., Nandam, S. R., & Akepogu, A. R. (2017). An iterative hadoop-based ensemble data classification model on distributed medical databases. In Advances in Intelligent Systems and Computing (Vol. 507, pp. 341–351). Springer Verlag. https://doi.org/10.1007/978-981-10-2471-9_33

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free