Predicting Student Dropout in E-Learning Using Simple Machine Learning and Explainable Data Analysis

Authors

  • Aliyu Ibrahim Ahmad Universitas Islam Negeri K.H. Abdurrahman Wahid Pekalongan, Pekalongan
  • D.A Nugroho Universitas Islam Negeri K.H. Abdurrahman Wahid Pekalongan, Pekalongan
  • S.A. Aliyu Federal University Dutse, Dutse
  • A.M. Abdullahi Noida International University, Delhi

DOI:

https://doi.org/10.28918/logiclink.v2i2.13116

Abstract

Online learning allows a lot of flexibility and accessibility, but the issue of student dropout becomes one of the key problems. Much of the research that has been done to predict dropout using complex machine learning methods has not been interpretable, and many of these methods are hard to implement in practice, especially in resource constrained environments. This research closes this gap by presenting a simple and explainable machine learning method of predicting early dropout. Students in several universities were surveyed on the frequency of attendance, quiz performance, completion of assignments, satisfaction with learning, and hours spent studying per week. Because there were only a few real dropout cases, a controlled synthetic data augmentation method was used to demonstrate and train the model. The use of Logistic Regression and Decision Tree classifiers was used to predict the risk of dropout. The accuracy of both models was 87.5% with the most effective indicators being learning satisfaction, attendance, and prior consideration of dropout. This research is considered new because it focuses on the simplicity and interpretability of models rather than the complexity of creating early warning systems to predict the dropout of students, showing that they are not required to be complex or made out of heavyweight models to perform well. Even though the findings are merely exploratory due to limitations of the datasets, the results show that even simplistic models might be used to assist educators in identifying at-risk learners and incorporating prompt intervention measures.

Keywords:

e-learning, student dropout, machine learning, interpretable analysis, student retention

References

Aldowah, H., Ul Rehman, S., Ghazal, S., & Umar, I. N. (2020). Predicting student performance in MOOCs using machine learning methods. IEEE Access, 8, 106420–106433.

Al-Shabandar, R., Hussain, A., Liatsis, P., & Keight, R. (2020). Predicting student performance in MOOCs using machine learning models. Computers in Human Behavior, 107, 105–115.

Ashraf, M., Abdullah, S., & Arshad, S. (2021). Student academic performance prediction using interpretable machine learning models. International Journal of Emerging Technologies in Learning, 16(9), 4–19.

Baker, R. S., & Hawn, A. (2021). Algorithmic bias in education. International Journal of Artificial Intelligence in Education, 31(3), 1–26.

Cui, Y., Zhang, Y., & Yang, L. (2022). Explainable machine learning for early warning systems in online education. IEEE Access, 10, 20539–20550.

Dawson, S., Gašević, D., Siemens, G., & Joksimovic, S. (2020). Current state and future trends in learning analytics. Journal of Learning Analytics, 7(3), 43–57.

Fan, C., Wang, J., & Zhang, X. (2021). Interpretable machine learning in education: A systematic review. Education and Information Technologies, 26(4), 1–22. https://link.springer.com/article/10.1007/s10639-021-10409-8

He, J., Bailey, J., Rubinstein, B. I. P., & Zhang, R. (2021). Identifying at-risk students using machine learning techniques. IEEE Transactions on Learning Technologies, 14(1), 1–13.

Khalil, M., Prinsloo, P., & Slade, S. (2022). Ethics and learning analytics: Student perspectives. British Journal of Educational Technology, 53(2), 1–15.

Klašnja-Milićević, A., Ivanović, M., & Budimac, Z. (2021). Data-driven learning analytics in education. Computers & Education, 160, 104020.

Kumar, V., Singh, S., & Sharma, A. (2021). Predicting student dropout in e-learning: A machine learning approach. Journal of Educational Technology Systems, 49(4), 529–550.

Mubarak, A. A., Cao, H., & Zhang, W. (2021). Prediction of students’ early dropout using machine learning techniques. IEEE Access, 9, 42775–42786.

Ndukwe, I. G., & Daniel, B. K. (2020). Teaching analytics: A tool for teacher development. Education and Information Technologies, 25(3), 1–20.

Priya, P., & Sridevi, S. (2022). A comprehensive survey on student dropout prediction models. International Journal of Information Management Data Insights, 2(1), 100087.

Stiglic, G., Kocbek, P., Pernek, I., & Kokol, P. (2020). Interpretability of machine learning models in education. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 10(5), e1374.

Wong, B. T.-M., & Li, K. C. (2021). Learning analytics intervention in higher education: A review. Interactive Learning Environments, 29(8), 1295–1310.

Xing, W., & Du, D. (2022). Dropout prediction in online learning: A data-driven and explainable approach. International Journal of Emerging Technologies in Learning, 17(6), 15–29. https://www.online-journals.org/index.php/i-jet/article/view/28295

Zhang, Q., Wang, Y., & Chen, L. (2023). Explainable artificial intelligence for dropout prediction in online education. Expert Systems with Applications, 213, 118129. https://doi.org/10.1016/j.eswa.2022.118129

Downloads

Published

2025-12-29

Article Statistics

201 Views
115 Downloads

Issue

Section

Articles

How to Cite

Predicting Student Dropout in E-Learning Using Simple Machine Learning and Explainable Data Analysis. (2025). LogicLink, 2(2), 138 – 148. https://doi.org/10.28918/logiclink.v2i2.13116