Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment
Background: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Shiraz University of Medical Sciences
2024-12-01
|
| Series: | Interdisciplinary Journal of Virtual Learning in Medical Sciences |
| Subjects: | |
| Online Access: | https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdf |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1846098902853353472 |
|---|---|
| author | Seyede Fatemeh Noorani Maryam Karimi Zahra Gholijafari |
| author_facet | Seyede Fatemeh Noorani Maryam Karimi Zahra Gholijafari |
| author_sort | Seyede Fatemeh Noorani |
| collection | DOAJ |
| description | Background: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to develop a robust model for predicting learners' performance using ensemble machine learning and feature engineering techniques.Methods: This research employed a classification approach based on the Digital Electronic Education and Design Suite (DEEDS) dataset, which records real-time interactions of learners within an online educational environment. The dataset analyzed in this research included activity logs from 115 undergraduate students majoring in computer engineering who participated in a digital electronics course at the University of Genoa, Italy, between September and December 2015. Various machine learning algorithms, including Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient Boosting (GB), Light Gradient-Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost), were applied. The study also utilized ensemble learning methods such as Boosting and Stacking to enhance prediction accuracy. Feature engineering techniques were implemented to extract and select relevant features from the dataset, leading to the development of a predictive model.Results: The proposed model achieved an accuracy of 97.43%, a precision of 96.20%, and an F1-score of 98.06%, indicating an acceptable predictive capability. Notably, the findings revealed that feature selection significantly enhanced performance; in the absence of feature selection, the accuracy dropped to 92.15%. Additionally, ensemble methods like Boosting and Stacking provided a 15% enhancement in prediction accuracy compared to traditional approaches. Overall, the integration of feature engineering and ensemble techniques acceptably optimized the model's ability to predict learners’ academic performance in online educational settings. Conclusion: This research validates the effectiveness of employing ensemble machine learning techniques and feature engineering in predicting learners’ academic performance in online education. Future studies should explore additional ensemble methods and incorporate diverse feature types to enhance prediction accuracy. |
| format | Article |
| id | doaj-art-0f74a5d1d1a442f59a23e3c9d9024c16 |
| institution | Kabale University |
| issn | 2476-7263 2476-7271 |
| language | English |
| publishDate | 2024-12-01 |
| publisher | Shiraz University of Medical Sciences |
| record_format | Article |
| series | Interdisciplinary Journal of Virtual Learning in Medical Sciences |
| spelling | doaj-art-0f74a5d1d1a442f59a23e3c9d9024c162025-01-01T06:48:21ZengShiraz University of Medical SciencesInterdisciplinary Journal of Virtual Learning in Medical Sciences2476-72632476-72712024-12-0115436938710.30476/ijvlms.2024.101157.127950587Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational EnvironmentSeyede Fatemeh Noorani0Maryam Karimi1Zahra Gholijafari2Department of Information Technology and Computer Engineering, Payame Noor University, Tehran, IranDepartment of Computer Sciences, Faculty of Mathematical Sciences, Shahrkord, IranDepartment of Information Technology and Computer Engineering, Payame Noor University, Tehran, IranBackground: Online training has gained popularity as an effective teaching method, necessitating diligent monitoring of learner progress and engagement. The challenge of predicting academic performance in online courses is crucial for supporting learners at risk of academic loss. This study aimed to develop a robust model for predicting learners' performance using ensemble machine learning and feature engineering techniques.Methods: This research employed a classification approach based on the Digital Electronic Education and Design Suite (DEEDS) dataset, which records real-time interactions of learners within an online educational environment. The dataset analyzed in this research included activity logs from 115 undergraduate students majoring in computer engineering who participated in a digital electronics course at the University of Genoa, Italy, between September and December 2015. Various machine learning algorithms, including Random Forest (RF), Adaptive Boosting (AdaBoost), Gradient Boosting (GB), Light Gradient-Boosting Machine (LightGBM), and eXtreme Gradient Boosting (XGBoost), were applied. The study also utilized ensemble learning methods such as Boosting and Stacking to enhance prediction accuracy. Feature engineering techniques were implemented to extract and select relevant features from the dataset, leading to the development of a predictive model.Results: The proposed model achieved an accuracy of 97.43%, a precision of 96.20%, and an F1-score of 98.06%, indicating an acceptable predictive capability. Notably, the findings revealed that feature selection significantly enhanced performance; in the absence of feature selection, the accuracy dropped to 92.15%. Additionally, ensemble methods like Boosting and Stacking provided a 15% enhancement in prediction accuracy compared to traditional approaches. Overall, the integration of feature engineering and ensemble techniques acceptably optimized the model's ability to predict learners’ academic performance in online educational settings. Conclusion: This research validates the effectiveness of employing ensemble machine learning techniques and feature engineering in predicting learners’ academic performance in online education. Future studies should explore additional ensemble methods and incorporate diverse feature types to enhance prediction accuracy.https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdfinformation sciencesupervised machine learningeducationaldata miningdimensionality reductioncomputer-assisted |
| spellingShingle | Seyede Fatemeh Noorani Maryam Karimi Zahra Gholijafari Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment Interdisciplinary Journal of Virtual Learning in Medical Sciences information science supervised machine learning educational data mining dimensionality reduction computer-assisted |
| title | Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment |
| title_full | Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment |
| title_fullStr | Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment |
| title_full_unstemmed | Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment |
| title_short | Using Ensemble Machine Learning and Feature Engineering to Increase the Accuracy of Predicting Learners' Performance in an Online Educational Environment |
| title_sort | using ensemble machine learning and feature engineering to increase the accuracy of predicting learners performance in an online educational environment |
| topic | information science supervised machine learning educational data mining dimensionality reduction computer-assisted |
| url | https://ijvlms.sums.ac.ir/article_50587_e403e00e208e76873b73eab4816479b6.pdf |
| work_keys_str_mv | AT seyedefatemehnoorani usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment AT maryamkarimi usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment AT zahragholijafari usingensemblemachinelearningandfeatureengineeringtoincreasetheaccuracyofpredictinglearnersperformanceinanonlineeducationalenvironment |