Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability

Accurate prediction of power output from a photovoltaic (PV) system is crucial for ensuring operational efficiency. This study addresses the challenge of predicting plant-scale PV power output by integrating automated machine learning (Auto-ML) with explainable modeling techniques. The integrated ap...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad Paend Bakht, Mohd Norzali Haji Mohd, Babul Salam KSM Kader Ibrahim, Nuzhat Khan, Usman Ullah Sheikh, Ab Al-Hadi Ab Rahman
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Results in Engineering
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2590123024020814
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841553813773746176
author Muhammad Paend Bakht
Mohd Norzali Haji Mohd
Babul Salam KSM Kader Ibrahim
Nuzhat Khan
Usman Ullah Sheikh
Ab Al-Hadi Ab Rahman
author_facet Muhammad Paend Bakht
Mohd Norzali Haji Mohd
Babul Salam KSM Kader Ibrahim
Nuzhat Khan
Usman Ullah Sheikh
Ab Al-Hadi Ab Rahman
author_sort Muhammad Paend Bakht
collection DOAJ
description Accurate prediction of power output from a photovoltaic (PV) system is crucial for ensuring operational efficiency. This study addresses the challenge of predicting plant-scale PV power output by integrating automated machine learning (Auto-ML) with explainable modeling techniques. The integrated approach enhances predictive accuracy, supporting well-informed decision-making in power systems through data-driven frameworks. Real PV power data from a plant at Universiti Tun Hussein Onn Malaysia (UTHM) and five key weather parameters were used in this experiment. Auto-ML was employed to automatically identify the best-performing models tailored to the dataset. The top four performing models, achieving the highest predictive accuracies, were identified as Extra Tree (91% accuracy), Random Forest (85%), XGBoost (75%), and Decision Tree (68%) for further analysis. Their performance was then validated against commonly used artificial neural networks (ANN) and support vector machines (SVM) using multiple evaluation metrics including prediction accuracy, error rates, and interpretability. The results clearly demonstrate the superiority of the proposed approach across all performance metrics. For practical applications, a novel data mining method is also proposed to identify primary environmental drivers of PV performance using bivariate data analysis. Additionally, the model-based role of each parameter in the machine learning (ML) context is assessed using additivity of feature importance to uncover the underlying predictive mechanism of each ML model. This study establishes an advanced and powerful framework combining Auto-ML and explainable AI for predictive modeling of PV power output. It sets new standards for significantly improved operational decisions and a broader integration of AI in renewable energy forecasting for data-driven optimization in power systems.
format Article
id doaj-art-56bf6ef0eb8a42da80bb9b73ac8d8a84
institution Kabale University
issn 2590-1230
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Results in Engineering
spelling doaj-art-56bf6ef0eb8a42da80bb9b73ac8d8a842025-01-09T06:14:29ZengElsevierResults in Engineering2590-12302025-03-0125103838Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretabilityMuhammad Paend Bakht0Mohd Norzali Haji Mohd1Babul Salam KSM Kader Ibrahim2Nuzhat Khan3Usman Ullah Sheikh4Ab Al-Hadi Ab Rahman5Faculty of Electrical and Electronic Engineering, Universiti Tun Hussein Onn Malaysia, Parit Raja, 86400, Malaysia; Department of Electrical Engineering, Balochistan University of Information Technology Engineering and Management Sciences, Quetta, 87300, PakistanFaculty of Electrical and Electronic Engineering, Universiti Tun Hussein Onn Malaysia, Parit Raja, 86400, MalaysiaGUST Engineering & Applied Innovation Research Centre (GEAR), Gulf University for Science & Technology, Hawally, 7207, Kuwait; Corresponding authors.Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, 81300, Johor, Malaysia; Corresponding authors.Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, 81300, Johor, Malaysia; Corresponding authors.Faculty of Electrical Engineering, Universiti Teknologi Malaysia, Johor Bahru, 81300, Johor, MalaysiaAccurate prediction of power output from a photovoltaic (PV) system is crucial for ensuring operational efficiency. This study addresses the challenge of predicting plant-scale PV power output by integrating automated machine learning (Auto-ML) with explainable modeling techniques. The integrated approach enhances predictive accuracy, supporting well-informed decision-making in power systems through data-driven frameworks. Real PV power data from a plant at Universiti Tun Hussein Onn Malaysia (UTHM) and five key weather parameters were used in this experiment. Auto-ML was employed to automatically identify the best-performing models tailored to the dataset. The top four performing models, achieving the highest predictive accuracies, were identified as Extra Tree (91% accuracy), Random Forest (85%), XGBoost (75%), and Decision Tree (68%) for further analysis. Their performance was then validated against commonly used artificial neural networks (ANN) and support vector machines (SVM) using multiple evaluation metrics including prediction accuracy, error rates, and interpretability. The results clearly demonstrate the superiority of the proposed approach across all performance metrics. For practical applications, a novel data mining method is also proposed to identify primary environmental drivers of PV performance using bivariate data analysis. Additionally, the model-based role of each parameter in the machine learning (ML) context is assessed using additivity of feature importance to uncover the underlying predictive mechanism of each ML model. This study establishes an advanced and powerful framework combining Auto-ML and explainable AI for predictive modeling of PV power output. It sets new standards for significantly improved operational decisions and a broader integration of AI in renewable energy forecasting for data-driven optimization in power systems.http://www.sciencedirect.com/science/article/pii/S2590123024020814Solar photovoltaicMachine learningArtificial intelligencePower predictionSustainable
spellingShingle Muhammad Paend Bakht
Mohd Norzali Haji Mohd
Babul Salam KSM Kader Ibrahim
Nuzhat Khan
Usman Ullah Sheikh
Ab Al-Hadi Ab Rahman
Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
Results in Engineering
Solar photovoltaic
Machine learning
Artificial intelligence
Power prediction
Sustainable
title Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
title_full Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
title_fullStr Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
title_full_unstemmed Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
title_short Advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and SHAP interpretability
title_sort advanced automated machine learning framework for photovoltaic power output prediction using environmental parameters and shap interpretability
topic Solar photovoltaic
Machine learning
Artificial intelligence
Power prediction
Sustainable
url http://www.sciencedirect.com/science/article/pii/S2590123024020814
work_keys_str_mv AT muhammadpaendbakht advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability
AT mohdnorzalihajimohd advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability
AT babulsalamksmkaderibrahim advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability
AT nuzhatkhan advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability
AT usmanullahsheikh advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability
AT abalhadiabrahman advancedautomatedmachinelearningframeworkforphotovoltaicpoweroutputpredictionusingenvironmentalparametersandshapinterpretability