Reinforcement Learning for Join Order Optimization in PostgreSQL: Query Rewriting and Evaluation on JOB and TPC-H Benchmarks

Mohammed Omar; Nawzat Ahmed

doi:10.38094/jastt7031087

2026
Special Issue: Selected Proceedings of the 1st International Conference on Artificial Intelligence for Sustainability in the Developing World (AISDW2025)

Special Issue: Selected Proceedings of the 1st International Conference on Artificial Intelligence for Sustainability in the Developing World

Reinforcement Learning for Join Order Optimization in PostgreSQL: Query Rewriting and Evaluation on JOB and TPC-H Benchmarks

Published 2026-04-19

Mohammed R. Omar
Nawzat S. Ahmed

Mohammed R. Omar
Department of Information Technology Management,Duhok Polytechnic University,Duhok, Iraq

Nawzat S. Ahmed
Department of Information Technology Management, Duhok Polytechnic University, Duhok, Iraq

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

How to Cite

[1]

M. Omar and N. Ahmed, “Reinforcement Learning for Join Order Optimization in PostgreSQL: Query Rewriting and Evaluation on JOB and TPC-H Benchmarks”, JASTT, vol. 7, no. 03, pp. 46–54, Apr. 2026, doi: 10.38094/jastt7031087.

Download Citation

Abstract

Join order optimization is a critical combinatorial problem in query processing. This paper applies reinforcement learning (RL) techniques to the join order optimization task in PostgreSQL by implementing and evaluating three separate RL-based optimizers: Proximal Policy Optimization (PPO), Deep Q-Network (DQN), and Advantage Actor-Critic (A2C). Each method is trained on the Join Order Benchmark (JOB) and evaluated on both JOB and TPC-H workloads. Performance is compared against PostgreSQL’s default planner without join reordering (PG-Old) and the built-in PostgreSQL optimizer with reordering enabled (PG-Join). Results show that RL-based methods significantly reduce execution times compared to PG-Old and often perform on par with or better than PG-Join, especially for complex multi-join queries. Among the tested methods, PPO achieves the most consistent improvements, with up to 4.47× average speedup over PG-Old on JOB and measurable gains on TPC-H. These findings demonstrate the potential of reinforcement learning as a practical and adaptive approach to join order optimization in relational databases.

Keywords

Reinforcement Learning, Query Optimization, Join Order, TPC-H, PostgreSQL, Join Order Benchmark

PDF

References

13, Aug. 2018, doi: 10.14778/3229863.3236222.
V. Mnih et al., “Playing Atari with Deep Reinforcement Learning,” Dec. 2013, [Online]. Available: http://arxiv.org/abs/1312.5602
A. Brim, “Deep Reinforcement Learning Pairs Trading with a Double Deep Q-Network,” in 2020 10th Annual Computing and Communication Workshop and Conference (CCWC), IEEE, Jan. 2020, pp. 0222–0227. doi: 10.1109/CCWC47524.2020.9031159.
V. Mnih et al., “Asynchronous Methods for Deep Reinforcement Learning,” Feb. 2016, [Online]. Available: http://arxiv.org/abs/1602.01783
A. Mikhaylov, N. S. Mazyavkina, M. Salnikov, I. Trofimov, F. Qiang, and E. Burnaev, “Learned Query Optimizers: Evaluation and Improvement,” IEEE Access, vol. 10, pp. 75205–75218, 2022, doi: 10.1109/ACCESS.2022.3190376.
Y. Gu, Y. Cheng, C. L. P. Chen, and X. Wang, “Proximal Policy Optimization With Policy Feedback,” IEEE Trans. Syst. Man Cybern. Syst., vol. 52, no. 7, pp. 4600–4610, Jul. 2022, doi: 10.1109/TSMC.2021.3098451.
V. Leis et al., “Query optimization through the looking glass, and what we found running the Join Order Benchmark,” The VLDB Journal, vol. 27, no. 5, pp. 643–668, Oct. 2018, doi: 10.1007/s00778-017-0480-7.
“TPC Benchmark TM H Standard Specification Revision 2.17.1 TPC BENCHMARK TM H,” 1993.
R. Marcus and O. Papaemmanouil, “Towards a Hands-Free Query Optimizer through Deep Learning,” Dec. 2018, [Online]. Available: http://arxiv.org/abs/1809.10212

Downloads

Download data is not yet available.

Reinforcement Learning for Join Order Optimization in PostgreSQL: Query Rewriting and Evaluation on JOB and TPC-H Benchmarks

Abstract

Keywords

References

Downloads

Similar Articles

Most read articles by the same author(s)

Similar Articles

Deep Learning-Driven Visual Analytics Framework for Next-Generation Environmental Monitoring

A Hybrid CBIR Framework Using Vision Transformers and Genetic Algorithm for Enhanced Image Retrieval

Physics-Informed Machine Learning Framework for Virtual Screening and Multi-Objective Optimization of Polymer Nanocomposites with Tailored Multifunctional Properties

Classification of Brain Tumor based on Machine Learning Algorithms: A Review

Adaptive Federated Learning Empowered Wireless Localization Framework Using Vehicle Sensors

Trust-Aware and Adaptive Malicious Node Detection in Fog Network using Independent DQN with Centralized Training

Software Effort Estimation Using Stacking Ensemble and Bayesian Optimization

Hybrid Deep Learning Approach for Marine Debris Detection in Satellite Imagery Using UNet with ResNext50 Backbone

Intelligent Resource Management and Secure Live Migration in Cloud Environments: A Unified Approach using Particle Swarm Optimization, Machine Learning, and Blockchain on XenServer

AQUAPHISH: Leveraging Metaheuristics and Automated Machine Learning for Precision Phishing Detection