On the advantage of using dedicated data mining techniques to predict colorectal cancer

13Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Electronic Medical Records (EMRs) provide a wealth of data that can be used to generate predictive models for diseases. Quite some studies have been performed that use EMRs to generate such models for specific diseases, but most of them are based on more traditional techniques used in medical domain, such as logistic regression. This paper studies the benefit of using advanced data mining techniques for Colorectal Cancer (CRC). CRC is the second most common cancer in the EU and is known to be a disease with very a-specific predictors, making it difficult to generate good predictive models. In addition, the EMR data itself has its own challenges, including the sparsity, the differences in which physicians code the data, the temporal nature of the data, and the imbalance in the data. Results show that state-of-the-art data mining techniques, including temporal data mining, are able to generate better predictive models than currently available in the literature.

Cite

CITATION STYLE

APA

Kop, R., Hoogendoorn, M., Moons, L. M. G., Numans, M. E., & ten Teije, A. (2015). On the advantage of using dedicated data mining techniques to predict colorectal cancer. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9105, pp. 133–142). Springer Verlag. https://doi.org/10.1007/978-3-319-19551-3_16

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free