An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorptio...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2024-12-01
|
| Series: | Results in Engineering |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2590123024017778 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1846115739196456960 |
|---|---|
| author | Mahmood Yousefi Vahide Oskoei Hamid Reza Esmaeli Mansour Baziar |
| author_facet | Mahmood Yousefi Vahide Oskoei Hamid Reza Esmaeli Mansour Baziar |
| author_sort | Mahmood Yousefi |
| collection | DOAJ |
| description | The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorption ratio (SAR) using data-driven techniques. It explores the AdaBoost algorithm and a hybrid model that combines AdaBoost with Extra Trees (ET). The research centers on the Sarayan region in southern Khorasan, Iran, involving the collection and analysis of groundwater quality data, which includes both physical and chemical parameters, spanning a period of four years. Pearson correlation analysis is utilized in this study to determine the critical input variables for predicting Sodium Adsorption Ratio (SAR) and Potential Salinity (PS). Significant performance improvements are demonstrated through the optimization of AdaBoost and the hybrid AdaBoost-Extra Trees (ET) models, achieved via grid search and Bayesian optimization. The results indicate that the hybrid model surpasses AdaBoost in performance. Specifically, the optimized AdaBoost model achieves a Test MSE of 3.57 and a Test R² of 0.87 for SAR prediction, whereas the hybrid model obtains a significantly better Test MSE of 0.87 and Test R² of 0.97. For the prediction of PS, the optimized AdaBoost model yields a Test MSE of 0.223 and a Test R² of 0.976, while the hybrid model delivers a significantly improved Test MSE of 0.018 and Test R² of 0.998. Feature importance analysis identifies critical patterns in the relevance of input parameters. This study underscores the efficacy of hybrid data-driven models in water quality prediction, showcasing their superior accuracy. |
| format | Article |
| id | doaj-art-aa7488a951ce4ce88a230b683819b5d7 |
| institution | Kabale University |
| issn | 2590-1230 |
| language | English |
| publishDate | 2024-12-01 |
| publisher | Elsevier |
| record_format | Article |
| series | Results in Engineering |
| spelling | doaj-art-aa7488a951ce4ce88a230b683819b5d72024-12-19T10:59:53ZengElsevierResults in Engineering2590-12302024-12-0124103534An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indicesMahmood Yousefi0Vahide Oskoei1Hamid Reza Esmaeli2Mansour Baziar3Department of Environmental Health Engineering, School of Public Health, Khoy University of Medical Sciences, Khoy, IranSchool of Life and Environmental Science, Deakin University, Geelong, AustraliaDepartment of Environmental Health Engineering, Ferdows Faculty of Medical Sciences, Birjand University of Medical Sciences, Birjand, IranDepartment of Environmental Health Engineering, Ferdows Faculty of Medical Sciences, Birjand University of Medical Sciences, Birjand, Iran; Medical Toxicology and Drug abuse Research Center (MTDRC), Birjand University of Medical Sciences (BUMS), Birjand, Iran; Corresponding author.The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorption ratio (SAR) using data-driven techniques. It explores the AdaBoost algorithm and a hybrid model that combines AdaBoost with Extra Trees (ET). The research centers on the Sarayan region in southern Khorasan, Iran, involving the collection and analysis of groundwater quality data, which includes both physical and chemical parameters, spanning a period of four years. Pearson correlation analysis is utilized in this study to determine the critical input variables for predicting Sodium Adsorption Ratio (SAR) and Potential Salinity (PS). Significant performance improvements are demonstrated through the optimization of AdaBoost and the hybrid AdaBoost-Extra Trees (ET) models, achieved via grid search and Bayesian optimization. The results indicate that the hybrid model surpasses AdaBoost in performance. Specifically, the optimized AdaBoost model achieves a Test MSE of 3.57 and a Test R² of 0.87 for SAR prediction, whereas the hybrid model obtains a significantly better Test MSE of 0.87 and Test R² of 0.97. For the prediction of PS, the optimized AdaBoost model yields a Test MSE of 0.223 and a Test R² of 0.976, while the hybrid model delivers a significantly improved Test MSE of 0.018 and Test R² of 0.998. Feature importance analysis identifies critical patterns in the relevance of input parameters. This study underscores the efficacy of hybrid data-driven models in water quality prediction, showcasing their superior accuracy.http://www.sciencedirect.com/science/article/pii/S2590123024017778Water quality predictionBayesian optimizationHybrid AdaBoostGroundwaterGrid search |
| spellingShingle | Mahmood Yousefi Vahide Oskoei Hamid Reza Esmaeli Mansour Baziar An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices Results in Engineering Water quality prediction Bayesian optimization Hybrid AdaBoost Groundwater Grid search |
| title | An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| title_full | An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| title_fullStr | An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| title_full_unstemmed | An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| title_short | An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| title_sort | innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices |
| topic | Water quality prediction Bayesian optimization Hybrid AdaBoost Groundwater Grid search |
| url | http://www.sciencedirect.com/science/article/pii/S2590123024017778 |
| work_keys_str_mv | AT mahmoodyousefi aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT vahideoskoei aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT hamidrezaesmaeli aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT mansourbaziar aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT mahmoodyousefi innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT vahideoskoei innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT hamidrezaesmaeli innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices AT mansourbaziar innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices |