Abstract
In this paper, the effectiveness of using random forest algorithm in the big data is studied. The reason for choosing this algorithm is because of its effective results in many previous studies, so it was chosen. The random forest algorithm was applied to the big data Internet of Things (IoT) dataset, with size 150,000 instances. After applying the algorithm, it gave an Accuracy score of 99.976%, and this indicates the effectiveness of the random forest algorithm in the big data. This result made the researcher search for one of the ways that increases the value of Accuracy, knowing that it is an excellent result. And use a filter to remove frequent values, and this helps reduce data volume and keep related data. After applying the filter with the random forest algorithm, the Accuracy value appeared at 99.998%, which indicates that it improved the random forest algorithm's performance in the big data.
Author supplied keywords
Cite
CITATION STYLE
Saadoon, Y. A., & Abdulamir, R. H. (2021). Improved Random Forest Algorithm Performance for Big Data. In Journal of Physics: Conference Series (Vol. 1897). IOP Publishing Ltd. https://doi.org/10.1088/1742-6596/1897/1/012071
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.