Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach

As data-centric applications become increasingly complex, understanding effective query optimization in large-scale relational databases is crucial for managing this complexity. Yet, traditional cost-based and heuristic approaches simply do not scale, adapt, or remain accurate in highly dynamic mult...

Full description

Saved in:

Bibliographic Details
Main Authors:	Najla Sassi, Wassim Jaziri
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	Mathematics
Subjects:	query optimization reinforcement learning graph neural networks join order selection large-scale databases resource efficiency
Online Access:	https://www.mdpi.com/2227-7390/13/11/1700
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849330984441348096
author	Najla Sassi Wassim Jaziri
author_facet	Najla Sassi Wassim Jaziri
author_sort	Najla Sassi
collection	DOAJ
description	As data-centric applications become increasingly complex, understanding effective query optimization in large-scale relational databases is crucial for managing this complexity. Yet, traditional cost-based and heuristic approaches simply do not scale, adapt, or remain accurate in highly dynamic multi-join queries. This research work proposes the reinforcement learning and graph-based hybrid query optimizer (GRQO), the first ever to apply reinforcement learning and graph theory for optimizing query execution plans, specifically in join order selection and cardinality estimation. By employing proximal policy optimization for adaptive policy learning and using graph-based schema representations for relational modeling, GRQO effectively traverses the combinatorial optimization space. Based on TPC-H (1 TB) and IMDB (500 GB) workloads, GRQO runs 25% faster in query execution time, scales 30% better, reduces CPU and memory use by 20–25%, and reduces the cardinality estimation error by 47% compared to traditional cost-based optimizers and machine learning-based optimizers. These findings highlight the ability of GRQO to optimize performance and resource efficiency in database management in cloud computing, data warehousing, and real-time analytics.
format	Article
id	doaj-art-c0ecd0c0096a48e2a4c0cc772dd02e23
institution	Kabale University
issn	2227-7390
language	English
publishDate	2025-05-01
publisher	MDPI AG
record_format	Article
series	Mathematics
spelling	doaj-art-c0ecd0c0096a48e2a4c0cc772dd02e232025-08-20T03:46:46ZengMDPI AGMathematics2227-73902025-05-011311170010.3390/math13111700Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based ApproachNajla Sassi0Wassim Jaziri1Department of Management Information Systems, School of Business, King Faisal University, Hofuf 31982, Saudi ArabiaDepartment of Management Information Systems, School of Business, King Faisal University, Hofuf 31982, Saudi ArabiaAs data-centric applications become increasingly complex, understanding effective query optimization in large-scale relational databases is crucial for managing this complexity. Yet, traditional cost-based and heuristic approaches simply do not scale, adapt, or remain accurate in highly dynamic multi-join queries. This research work proposes the reinforcement learning and graph-based hybrid query optimizer (GRQO), the first ever to apply reinforcement learning and graph theory for optimizing query execution plans, specifically in join order selection and cardinality estimation. By employing proximal policy optimization for adaptive policy learning and using graph-based schema representations for relational modeling, GRQO effectively traverses the combinatorial optimization space. Based on TPC-H (1 TB) and IMDB (500 GB) workloads, GRQO runs 25% faster in query execution time, scales 30% better, reduces CPU and memory use by 20–25%, and reduces the cardinality estimation error by 47% compared to traditional cost-based optimizers and machine learning-based optimizers. These findings highlight the ability of GRQO to optimize performance and resource efficiency in database management in cloud computing, data warehousing, and real-time analytics.https://www.mdpi.com/2227-7390/13/11/1700query optimizationreinforcement learninggraph neural networksjoin order selectionlarge-scale databasesresource efficiency
spellingShingle	Najla Sassi Wassim Jaziri Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach Mathematics query optimization reinforcement learning graph neural networks join order selection large-scale databases resource efficiency
title	Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
title_full	Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
title_fullStr	Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
title_full_unstemmed	Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
title_short	Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach
title_sort	efficient ai driven query optimization in large scale databases a reinforcement learning and graph based approach
topic	query optimization reinforcement learning graph neural networks join order selection large-scale databases resource efficiency
url	https://www.mdpi.com/2227-7390/13/11/1700
work_keys_str_mv	AT najlasassi efficientaidrivenqueryoptimizationinlargescaledatabasesareinforcementlearningandgraphbasedapproach AT wassimjaziri efficientaidrivenqueryoptimizationinlargescaledatabasesareinforcementlearningandgraphbasedapproach

Efficient AI-Driven Query Optimization in Large-Scale Databases: A Reinforcement Learning and Graph-Based Approach

Similar Items