Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment

Background: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to...

Full description

Saved in:
Bibliographic Details
Main Authors: Seyede Fatemeh Noorani, Maryam Karimi, Zahra Gholijafari
Format: Article
Language:English
Published: Shiraz University of Medical Sciences 2024-12-01
Series:Interdisciplinary Journal of Virtual Learning in Medical Sciences
Subjects:
Online Access:https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846098902853353472
author Seyede Fatemeh Noorani
Maryam Karimi
Zahra Gholijafari
author_facet Seyede Fatemeh Noorani
Maryam Karimi
Zahra Gholijafari
author_sort Seyede Fatemeh Noorani
collection DOAJ
description Background: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to develop a robust model for predicting learners' performance using ensemble machine learning and feature engineering techniques.Methods: This research employed a classification approach based on the Digital Electronic Education and Design Suite (DEEDS) dataset, which records real-time interactions of learners within an online educational environment. The dataset analyzed in this research included activity logs from 115 undergraduate students majoring in computer engineering who participated in a digital electronics course at the University of Genoa, Italy, between September and December 2015. Various machine learning algorithms, including Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient Boosting (GB), Light Gradient-Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost), were applied. The study also utilized ensemble learning methods such as Boosting and Stacking to enhance prediction accuracy. Feature engineering techniques were implemented to extract and select relevant features from the dataset, leading to the development of a predictive model.Results: The proposed model achieved an accuracy of 97.43%, a precision of 96.20%, and an F1-score of 98.06%, indicating an acceptable predictive capability. Notably, the findings revealed that feature selection significantly enhanced performance; in the absence of feature selection, the accuracy dropped to 92.15%. Additionally, ensemble methods like Boosting and Stacking provided a 15% enhancement in prediction accuracy compared to traditional approaches. Overall, the integration of feature engineering and ensemble techniques acceptably optimized the model's ability to predict learners’ academic performance in online educational settings. Conclusion: This research validates the effectiveness of employing ensemble machine learning techniques and feature engineering in predicting learners’ academic performance in online education. Future studies should explore additional ensemble methods and incorporate diverse feature types to enhance prediction accuracy.
format Article
id doaj-art-0f74a5d1d1a442f59a23e3c9d9024c16
institution Kabale University
issn 2476-7263
2476-7271
language English
publishDate 2024-12-01
publisher Shiraz University of Medical Sciences
record_format Article
series Interdisciplinary Journal of Virtual Learning in Medical Sciences
spelling doaj-art-0f74a5d1d1a442f59a23e3c9d9024c162025-01-01T06:48:21ZengShiraz University of Medical SciencesInterdisciplinary Journal of Virtual Learning in Medical Sciences2476-72632476-72712024-12-0115436938710.30476/ijvlms.2024.101157.127950587Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational EnvironmentSeyede Fatemeh Noorani0Maryam Karimi1Zahra Gholijafari2Department of Information Technology and Computer Engineering, Payame Noor University, Tehran, IranDepartment of Computer Sciences, Faculty of Mathematical Sciences, Shahrkord, IranDepartment of Information Technology and Computer Engineering, Payame Noor University, Tehran, IranBackground: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to develop a robust model for predicting learners' performance using ensemble machine learning and feature engineering techniques.Methods: This research employed a classification approach based on the Digital Electronic Education and Design Suite (DEEDS) dataset, which records real-time interactions of learners within an online educational environment. The dataset analyzed in this research included activity logs from 115 undergraduate students majoring in computer engineering who participated in a digital electronics course at the University of Genoa, Italy, between September and December 2015. Various machine learning algorithms, including Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient Boosting (GB), Light Gradient-Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost), were applied. The study also utilized ensemble learning methods such as Boosting and Stacking to enhance prediction accuracy. Feature engineering techniques were implemented to extract and select relevant features from the dataset, leading to the development of a predictive model.Results: The proposed model achieved an accuracy of 97.43%, a precision of 96.20%, and an F1-score of 98.06%, indicating an acceptable predictive capability. Notably, the findings revealed that feature selection significantly enhanced performance; in the absence of feature selection, the accuracy dropped to 92.15%. Additionally, ensemble methods like Boosting and Stacking provided a 15% enhancement in prediction accuracy compared to traditional approaches. Overall, the integration of feature engineering and ensemble techniques acceptably optimized the model's ability to predict learners’ academic performance in online educational settings. Conclusion: This research validates the effectiveness of employing ensemble machine learning techniques and feature engineering in predicting learners’ academic performance in online education. Future studies should explore additional ensemble methods and incorporate diverse feature types to enhance prediction accuracy.https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdfinformation sciencesupervised machine learningeducationaldata miningdimensionality reductioncomputer-assisted
spellingShingle Seyede Fatemeh Noorani
Maryam Karimi
Zahra Gholijafari
Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
Interdisciplinary Journal of Virtual Learning in Medical Sciences
information science
supervised machine learning
educational
data mining
dimensionality reduction
computer-assisted
title Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
title_full Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
title_fullStr Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
title_full_unstemmed Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
title_short Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
title_sort using ensemble machine learning and feature engineering to increase the accuracy of predicting learners performance in an online educational environment
topic information science
supervised machine learning
educational
data mining
dimensionality reduction
computer-assisted
url https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdf
work_keys_str_mv AT seyedefatemehnoorani usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment
AT maryamkarimi usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment
AT zahragholijafari usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment