An iterative hadoop-based ensemble data classification model on distributed medical databases

Thulasi Bikku; Sambasiva Rao Nandam; Ananda Rao Akepogu

Conference Proceedings

An iterative hadoop-based ensemble data classification model on distributed medical databases

Advances in Intelligent Systems and Computing (2017) 507 341-351

DOI: 10.1007/978-981-10-2471-9_33

5Citations

5Readers

Get full text

Abstract

As the size and complexity of the online biomedical databases are growing day by day, finding an essential structure or unstructured patterns in the distributed biomedical applications has become more complex. Traditional Hadoop-based distributed decision tree models such as Probability based decision tree (PDT), Classification And Regression Tree (CART) and Multiclass Classification Decision Tree have failed to discover relational patterns, user-specific patterns and feature-based patterns, due to the large number of feature sets. These models depend on selection of relevant attributes and uniform data distribution. Data imbalance, indexing and sparsity are the three major issues in these distributed decision tree models. In this proposed model, an enhanced attributes selection ranking model and Hadoop-based decision tree model were implemented to extract the user-specific interesting patterns in online biomedical databases. Experimental results show that the proposed model has high true positive, high precision and low error rate compared to traditional distributed decision tree models.

Author supplied keywords

Cite

CITATION STYLE

APA

Bikku, T., Nandam, S. R., & Akepogu, A. R. (2017). An iterative hadoop-based ensemble data classification model on distributed medical databases. In Advances in Intelligent Systems and Computing (Vol. 507, pp. 341–351). Springer Verlag. https://doi.org/10.1007/978-981-10-2471-9_33

An iterative hadoop-based ensemble data classification model on distributed medical databases

Abstract

Author supplied keywords

Cite

Register to see more suggestions