Evaluating regional sustainable energy potential through hierarchical clustering and machine learning

Energy is an essential resource for sustaining daily life and achieving economic growth. The increase in global energy demand, combined with the adverse environmental impacts of fossil fuels, has highlighted the urgency of transitioning to sustainable energy sources. In large and heterogeneous count...

Full description

Saved in:
Bibliographic Details
Main Authors: Selen Avcı Azkeskin, Zerrin Aladağ
Format: Article
Language:English
Published: IOP Publishing 2025-01-01
Series:Environmental Research Communications
Subjects:
Online Access:https://doi.org/10.1088/2515-7620/ada2e5
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841556849643487232
author Selen Avcı Azkeskin
Zerrin Aladağ
author_facet Selen Avcı Azkeskin
Zerrin Aladağ
author_sort Selen Avcı Azkeskin
collection DOAJ
description Energy is an essential resource for sustaining daily life and achieving economic growth. The increase in global energy demand, combined with the adverse environmental impacts of fossil fuels, has highlighted the urgency of transitioning to sustainable energy sources. In large and heterogeneous countries like Türkiye, region-specific analyses of sustainable energy potential (SEP) are crucial for formulating effective policies and optimizing resource allocation. This study introduces a novel two-step hierarchical clustering and classification framework to evaluate the SEP of Türkiye’s provinces comprehensively. The framework combines fuzzy and crisp clustering methods to capture the complex relationships among socioeconomic, geographical, and renewable energy potential criteria. First, the Fuzzy C-Means (FCM) algorithm is employed to perform fuzzy clustering using three main criteria and three different distance metrics—Euclidean, Manhattan, and Minkowski—, resulting in 21 clustering scenarios. Membership degrees from the fuzzy clustering phase are then integrated into a new dataset, which undergoes crisp clustering using the K-Means algorithm. This approach provides both granular and definitive cluster structures, enabling a robust analysis of regional energy characteristics. To validate the clustering results, supervised classification methods—including K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—are utilized, alongside ensemble models based on RF and XGBoost. The classification results are compared with traditional clustering evaluation indices, such as the Silhouette and Calinski-Harabasz indices, demonstrating the feasibility of using classification models to assess clustering accuracy. This study’s key contributions lie in integrating clustering and classification methods systematically and providing actionable insights into which renewable energy sources are most suitable for each cluster. By tailoring policy recommendations to the unique characteristics of each cluster, this framework not only corroborates existing findings in the literature but also extends them by offering a practical methodology for regional energy planning.
format Article
id doaj-art-2b6d2f499554499fb6c0d8d3778f1a54
institution Kabale University
issn 2515-7620
language English
publishDate 2025-01-01
publisher IOP Publishing
record_format Article
series Environmental Research Communications
spelling doaj-art-2b6d2f499554499fb6c0d8d3778f1a542025-01-07T05:46:48ZengIOP PublishingEnvironmental Research Communications2515-76202025-01-017101500210.1088/2515-7620/ada2e5Evaluating regional sustainable energy potential through hierarchical clustering and machine learningSelen Avcı Azkeskin0https://orcid.org/0000-0001-7433-5696Zerrin Aladağ1Faculty of Engineering, Department of Industrial Engineering, Kocaeli University , Kocaeli, 41380, TürkiyeFaculty of Engineering and Architecture, Department of Industrial Engineering, İstanbul Nişantaşı University , İstanbul, 34398, TürkiyeEnergy is an essential resource for sustaining daily life and achieving economic growth. The increase in global energy demand, combined with the adverse environmental impacts of fossil fuels, has highlighted the urgency of transitioning to sustainable energy sources. In large and heterogeneous countries like Türkiye, region-specific analyses of sustainable energy potential (SEP) are crucial for formulating effective policies and optimizing resource allocation. This study introduces a novel two-step hierarchical clustering and classification framework to evaluate the SEP of Türkiye’s provinces comprehensively. The framework combines fuzzy and crisp clustering methods to capture the complex relationships among socioeconomic, geographical, and renewable energy potential criteria. First, the Fuzzy C-Means (FCM) algorithm is employed to perform fuzzy clustering using three main criteria and three different distance metrics—Euclidean, Manhattan, and Minkowski—, resulting in 21 clustering scenarios. Membership degrees from the fuzzy clustering phase are then integrated into a new dataset, which undergoes crisp clustering using the K-Means algorithm. This approach provides both granular and definitive cluster structures, enabling a robust analysis of regional energy characteristics. To validate the clustering results, supervised classification methods—including K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—are utilized, alongside ensemble models based on RF and XGBoost. The classification results are compared with traditional clustering evaluation indices, such as the Silhouette and Calinski-Harabasz indices, demonstrating the feasibility of using classification models to assess clustering accuracy. This study’s key contributions lie in integrating clustering and classification methods systematically and providing actionable insights into which renewable energy sources are most suitable for each cluster. By tailoring policy recommendations to the unique characteristics of each cluster, this framework not only corroborates existing findings in the literature but also extends them by offering a practical methodology for regional energy planning.https://doi.org/10.1088/2515-7620/ada2e5sustainable energy potentialclassificationhierarchical clusteringdistance metricsclustering accuracy
spellingShingle Selen Avcı Azkeskin
Zerrin Aladağ
Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
Environmental Research Communications
sustainable energy potential
classification
hierarchical clustering
distance metrics
clustering accuracy
title Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
title_full Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
title_fullStr Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
title_full_unstemmed Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
title_short Evaluating regional sustainable energy potential through hierarchical clustering and machine learning
title_sort evaluating regional sustainable energy potential through hierarchical clustering and machine learning
topic sustainable energy potential
classification
hierarchical clustering
distance metrics
clustering accuracy
url https://doi.org/10.1088/2515-7620/ada2e5
work_keys_str_mv AT selenavcıazkeskin evaluatingregionalsustainableenergypotentialthroughhierarchicalclusteringandmachinelearning
AT zerrinaladag evaluatingregionalsustainableenergypotentialthroughhierarchicalclusteringandmachinelearning