An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices

The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorptio...

Full description

Saved in:
Bibliographic Details
Main Authors: Mahmood Yousefi, Vahide Oskoei, Hamid Reza Esmaeli, Mansour Baziar
Format: Article
Language:English
Published: Elsevier 2024-12-01
Series:Results in Engineering
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590123024017778
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846115739196456960
author Mahmood Yousefi
Vahide Oskoei
Hamid Reza Esmaeli
Mansour Baziar
author_facet Mahmood Yousefi
Vahide Oskoei
Hamid Reza Esmaeli
Mansour Baziar
author_sort Mahmood Yousefi
collection DOAJ
description The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorption ratio (SAR) using data-driven techniques. It explores the AdaBoost algorithm and a hybrid model that combines AdaBoost with Extra Trees (ET). The research centers on the Sarayan region in southern Khorasan, Iran, involving the collection and analysis of groundwater quality data, which includes both physical and chemical parameters, spanning a period of four years. Pearson correlation analysis is utilized in this study to determine the critical input variables for predicting Sodium Adsorption Ratio (SAR) and Potential Salinity (PS). Significant performance improvements are demonstrated through the optimization of AdaBoost and the hybrid AdaBoost-Extra Trees (ET) models, achieved via grid search and Bayesian optimization. The results indicate that the hybrid model surpasses AdaBoost in performance. Specifically, the optimized AdaBoost model achieves a Test MSE of 3.57 and a Test R² of 0.87 for SAR prediction, whereas the hybrid model obtains a significantly better Test MSE of 0.87 and Test R² of 0.97. For the prediction of PS, the optimized AdaBoost model yields a Test MSE of 0.223 and a Test R² of 0.976, while the hybrid model delivers a significantly improved Test MSE of 0.018 and Test R² of 0.998. Feature importance analysis identifies critical patterns in the relevance of input parameters. This study underscores the efficacy of hybrid data-driven models in water quality prediction, showcasing their superior accuracy.
format Article
id doaj-art-aa7488a951ce4ce88a230b683819b5d7
institution Kabale University
issn 2590-1230
language English
publishDate 2024-12-01
publisher Elsevier
record_format Article
series Results in Engineering
spelling doaj-art-aa7488a951ce4ce88a230b683819b5d72024-12-19T10:59:53ZengElsevierResults in Engineering2590-12302024-12-0124103534An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indicesMahmood Yousefi0Vahide Oskoei1Hamid Reza Esmaeli2Mansour Baziar3Department of Environmental Health Engineering, School of Public Health, Khoy University of Medical Sciences, Khoy, IranSchool of Life and Environmental Science, Deakin University, Geelong, AustraliaDepartment of Environmental Health Engineering, Ferdows Faculty of Medical Sciences, Birjand University of Medical Sciences, Birjand, IranDepartment of Environmental Health Engineering, Ferdows Faculty of Medical Sciences, Birjand University of Medical Sciences, Birjand, Iran; Medical Toxicology and Drug abuse Research Center (MTDRC), Birjand University of Medical Sciences (BUMS), Birjand, Iran; Corresponding author.The management of water quality plays a pivotal role in fostering sustainable development, especially in areas where groundwater serves as a vital resource for agricultural purposes. This study investigates methods to predict water quality indicators like potential salinity (PS) and sodium adsorption ratio (SAR) using data-driven techniques. It explores the AdaBoost algorithm and a hybrid model that combines AdaBoost with Extra Trees (ET). The research centers on the Sarayan region in southern Khorasan, Iran, involving the collection and analysis of groundwater quality data, which includes both physical and chemical parameters, spanning a period of four years. Pearson correlation analysis is utilized in this study to determine the critical input variables for predicting Sodium Adsorption Ratio (SAR) and Potential Salinity (PS). Significant performance improvements are demonstrated through the optimization of AdaBoost and the hybrid AdaBoost-Extra Trees (ET) models, achieved via grid search and Bayesian optimization. The results indicate that the hybrid model surpasses AdaBoost in performance. Specifically, the optimized AdaBoost model achieves a Test MSE of 3.57 and a Test R² of 0.87 for SAR prediction, whereas the hybrid model obtains a significantly better Test MSE of 0.87 and Test R² of 0.97. For the prediction of PS, the optimized AdaBoost model yields a Test MSE of 0.223 and a Test R² of 0.976, while the hybrid model delivers a significantly improved Test MSE of 0.018 and Test R² of 0.998. Feature importance analysis identifies critical patterns in the relevance of input parameters. This study underscores the efficacy of hybrid data-driven models in water quality prediction, showcasing their superior accuracy.http://www.sciencedirect.com/science/article/pii/S2590123024017778Water quality predictionBayesian optimizationHybrid AdaBoostGroundwaterGrid search
spellingShingle Mahmood Yousefi
Vahide Oskoei
Hamid Reza Esmaeli
Mansour Baziar
An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
Results in Engineering
Water quality prediction
Bayesian optimization
Hybrid AdaBoost
Groundwater
Grid search
title An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
title_full An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
title_fullStr An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
title_full_unstemmed An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
title_short An innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
title_sort innovative combination of extra trees within adaboost for accurate prediction of agricultural water quality indices
topic Water quality prediction
Bayesian optimization
Hybrid AdaBoost
Groundwater
Grid search
url http://www.sciencedirect.com/science/article/pii/S2590123024017778
work_keys_str_mv AT mahmoodyousefi aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT vahideoskoei aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT hamidrezaesmaeli aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT mansourbaziar aninnovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT mahmoodyousefi innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT vahideoskoei innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT hamidrezaesmaeli innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices
AT mansourbaziar innovativecombinationofextratreeswithinadaboostforaccuratepredictionofagriculturalwaterqualityindices