Random Forest

Keisuke Yamaoka

Journal ArticleOPEN ACCESS

Random Forest

Yamaoka K

The Journal of The Institute of Image Information and Television Engineers (2012) 66(7) 573-575

DOI: 10.3169/itej.66.573

N/ACitations

26Readers

Abstract

For the task of analyzing survival data to derive risk factors associated with mortality, physicians, researchers, and biostatisticians have typically relied on certain types of regression techniques, most notably the Cox model. With the advent of more widely distributed computing power, methods which require more complex mathematics have become increasingly common. Particularly in this era of "big data" and machine learning, survival analysis has become methodologically broader. This paper aims to explore one technique known as Random Forest. The Random Forest technique is a regression tree technique which uses bootstrap aggregation and randomization of predictors to achieve a high degree of predictive accuracy. The various input parameters of the random forest are explored. Colon cancer data (n = 66,807) from the SEER database is then used to construct both a Cox model and a random forest model to determine how well the models perform on the same data. Both models perform well, achieving a concordance error rate of approximately 18%.

Cite

CITATION STYLE

APA

Yamaoka, K. (2012). Random Forest. The Journal of The Institute of Image Information and Television Engineers, 66(7), 573–575. https://doi.org/10.3169/itej.66.573

Random Forest

Abstract

Cite

Register to see more suggestions