Prediction of Myopia Among Undergraduate Students Using Ensemble Machine Learning Techniques

ABSTRACT Background and Aims Myopia is a prevalent refractive error, particularly among young adults, and is becoming a growing global concern. This study aims to predict myopia among undergraduate students using ensemble machine learning techniques and to identify key risk factors associated with i...

Full description

Saved in:
Bibliographic Details
Main Authors: Isteaq Kabir Sifat, Tajin Ahmed Jisa, Jyoti Shree Roy, Nourin Sultana, Farhana Hasan, Md Parvez Mosharaf, Md. Kaderi Kibria
Format: Article
Language:English
Published: Wiley 2025-05-01
Series:Health Science Reports
Subjects:
Online Access:https://doi.org/10.1002/hsr2.70874
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:ABSTRACT Background and Aims Myopia is a prevalent refractive error, particularly among young adults, and is becoming a growing global concern. This study aims to predict myopia among undergraduate students using ensemble machine learning techniques and to identify key risk factors associated with its development. Methods A cross‐sectional study was conducted in Dinajpur city, collecting 514 samples through a self‐structured questionnaire covering demographic information, myopia prevalence and risk factors, knowledge and attitudes, and daily activities. Four feature selection techniques Boruta‐based feature selection (BFS), Least Absolute Shrinkage and Selection Operator regression, Forward and Backward Selection and Random Forest (RF) identified 12 key predictive features. Using these features, ensemble methods, including logistic regression artificial neural network, RF, Support Vector Machine, extreme gradient boosting, and light gradient boosting machine were employed for prediction. Model performance was evaluated using accuracy, precision, recall, F1‐score, and area under the curve (AUC). Results The stacking ensemble model achieved the highest performance, with an accuracy of 95.42%, recall of 93.42%, precision of 98.85%, F1‐score of 96.08%, and AUC of 0.979. SHapley Additive exPlanations analysis identified key risk factors, including visual impairment, family history of myopia, excessive screen time, and insufficient outdoor activities. Conclusion These findings demonstrate the effectiveness of ensemble machine learning in predicting myopia and highlight the potential for early intervention strategies. By identifying high‐risk individuals, targeted awareness programs and lifestyle modifications can help mitigate myopia progression among undergraduate students.
ISSN:2398-8835