Learning Performance Prediction with Imbalanced Virtual Learning Environment Students’ Interactions Data

1Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.
Get full text

Abstract

One of the critical aspects in completing study in a virtual learning environment (VLE) is the student behavior when interacting with the system. However, in real cases, most of the student behavior data have imbalanced label distribution. This imbalanced dataset affects the model performance of machine learning algorithms significantly. This study attempts to examine several resampling methods such as random undersampling (RUS), oversampling with synthetic minority oversampling technique (SMOTE), and hybrid sampling (SMOTEENN) to resolve the imbalanced data issue. Several machine learning (ML) classifiers are employed to evaluate the efficiency of the resampling methods, including Naïve Bayes (NB), Logistic Regression (LR), and Random Forest (RF). The experiment results indicate that the performance of classifiers is improved utilizing more balanced dataset. Furthermore, the Random Forest classifier has accomplished the best result among all other models while using SMOTEENN as a resampling approach.

Cite

CITATION STYLE

APA

Chen, H. C., Prasetyo, E., Prayitno, Kusumawardani, S. S., Tseng, S. S., Kung, T. L., & Wang, K. Y. (2022). Learning Performance Prediction with Imbalanced Virtual Learning Environment Students’ Interactions Data. In Lecture Notes in Networks and Systems (Vol. 279, pp. 330–340). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-030-79728-7_33

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free