A Hybrid CBIR Framework Using Vision Transformers and Genetic Algorithm for Enhanced Image Retrieval

P Deekshita; Vandana Bonu; Areman Ramyasri; Vaddadi Rani; Keerthana Bodduru; Nagarjuna  Karyemsetty

doi:10.38094/jastt62304

Vol. 6 No. 2 (2025)

Standard Journal Issues

A Hybrid CBIR Framework Using Vision Transformers and Genetic Algorithm for Enhanced Image Retrieval

Published 2025-09-30

P. Deekshita
Vandana Bonu
Areman Ramyasri
Vaddadi Vasudha Rani
Bodduru Keerthana
Nagarjuna Karyemsetty

P. Deekshita
Department of Artificial Intelligence and Data Science, Vignan's Institute of Information Technology(A), Visakhapatnam, Andhra Pradesh, India

Vandana Bonu
Department of Information Technology, GMR Institute of Technology(A), Rajam, ?Andhra Pradesh, India

Areman Ramyasri
Department of Humanities and Management, G. Narayanamma Institute of technology and Sciences (for women), Hyderabad, Telangana, India

Vaddadi Vasudha Rani
Department of Information Technology, GMR Institute of Technology(A), Rajam, ?Andhra Pradesh, India

Bodduru Keerthana
Department of Information Technology, Anil Neerukonda Institute of Technology and Sciences(A), Sangivalasa, Visakhapatnam, Andhra Pradesh, India

Nagarjuna Karyemsetty
Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram, Guntur, Andhra Pradesh, India

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

How to Cite

[1]

P. Deekshita, V. Bonu, A. Ramyasri, V. Rani, K. Bodduru, and N. . Karyemsetty, “A Hybrid CBIR Framework Using Vision Transformers and Genetic Algorithm for Enhanced Image Retrieval”, JASTT, vol. 6, no. 2, pp. 277–289, Sep. 2025, doi: 10.38094/jastt62304.

Download Citation

Abstract

Content-Based Image Retrieval (CBIR) is an essential tool for arranging and acquiring visual content from large-scale image databases. This research presents a robust hybrid CBIR structure that combines transformer-based deep feature extraction with Genetic Algorithm (GA) optimization to significantly improve retrieval accuracy and efficiency. The proposed system introduces Vision Transformers (ViT) to efficiently capture intricate, global visual figures over the distinctive image categories, supporting both single and multi-object image retrieval scenarios. By influencing the long-range dependency modelling abilities of transformers, the system extracts highly different feature representations. These elements are further optimized with the help of Genetic Algorithm, a powerful adaptive technique that efficiently enhances feature selection and matching through iterative selection, crossover, and mutation processes. Comprehensive experiments were performed on the Corel 1K benchmark dataset illustrates the proposed hybrid model surpasses conventional CBIR model in terms of precision, recall, accuracy, and F1-score. The system achieves a retrieval accuracy of 99.38%, an F1-score of 95.12%, and a reduced error rate of 0.62%, showcasing its superior retrieval performance and computational efficiency. The results highlight the potential of integrating transformer-based deep learning with evolutionary optimization in advancing modern CBIR systems.

Keywords

Genetic Algorithm, Vision Transformer, Feature extraction, Corel 1k, Optimization, Deep Learning

PDF

References

S. Yenigalla, K. S. Rao, and P. Ngangbam, “Implementation of Content-Based Image Retrieval Using Artificial Neural Networks,” International Conference on “Holography Meets Advanced Manufacturing”), p. 25, Mar. 2023, doi: 10.3390/hmam2-14161.
M. H. Hadid, Q. M. Hussein, Z. T. Al-Qaysi, M. A. Ahmed, and M. M. Salih, “An Overview of Content-Based Image Retrieval Methods and Techniques,” Iraqi Journal for Computer Science and Mathematics, pp. 66–78, Jul. 2023, doi: 10.52866/ijcsm.2023.02.03.006.
S. Sikandar, R. Mahum, and A. Alsalman, “A novel hybrid approach for a Content-Based image retrieval using feature fusion,” Applied Sciences, vol. 13, no. 7, p. 4581, Apr. 2023, doi: 10.3390/app13074581.
N. Arora, A. Kakde, and S. C. Sharma, “An optimal approach for content-based image retrieval using deep learning on COVID-19 and pneumonia X-ray Images,” International Journal of Systems Assurance Engineering and Management, vol. 14, no. S1, pp. 246–255, Dec. 2022, doi: 10.1007/s13198-022-01846-4.
A. Mahbod, N. Saeidi, S. Hatamikia, and R. Woitek, “Evaluating pre-trained convolutional neural networks and foundation models as feature extractors for content-based medical image retrieval,” Engineering Applications of Artificial Intelligence, vol. 150, p. 110571, Mar. 2025, doi: 10.1016/j.engappai.2025.110571.
S. Fadaei, M. Azadimotlagh, A. Rashno, and A. Beheshti, “A new texture descriptor based on hexagonal local binary pattern for Content-Based image retrieval,” Digital Signal Processing, p. 105138, Mar. 2025, doi: 10.1016/j.dsp.2025.105138.
F. Shaheen and R. L. Raibagkar, “Efficient Content-Based Image Retrieval System with Two-Tier Hybrid Frameworks,” Applied Computer Systems, vol. 27, no. 2, pp. 166–182, Dec. 2022, doi: 10.2478/acss-2022-0018.
B. Duriqi, H. Snopçe, A. Salihu, A. Luma, and M. Fetaji, “Enhanced algorithm based on Chio-like Method for Non-Square Determinant Calculations for application in CBVR,” Journal of Applied Science and Technology Trends, vol. 6, no. 2, pp. 149–160, Aug. 2025, doi: 10.38094/jastt62253.
A. S. Ahmed and I. N. Ibraheem, “Recent advances in content based image retrieval using deep learning techniques: A survey,” AIP Conference Proceedings, vol. 3219, p. 030003, Jan. 2024, doi: 10.1063/5.0236594.
N. Hasan, Y. Bao, A. Shawon, and Y. Huang, “DenseNet Convolutional Neural Networks Application for predicting COVID-19 using CT Image,” SN Computer Science, vol. 2, no. 5, Jul. 2021, doi: 10.1007/s42979-021-00782-7.
Y. Huo, K. Jin, J. Cai, H. Xiong, and J. Pang, “Vision Transformer (ViT)-based Applications in Image Classification,” 2023 IEEE 9th Intl Conference on Big Data Security on Cloud (BigDataSecurity), IEEE Intl Conference on High Performance and Smart Computing, (HPSC) and IEEE Intl Conference on Intelligent Data and Security (IDS), pp. 135–140, May 2023, doi: 10.1109/bigdatasecurity-hpsc-ids58521.2023.00033.
Ch. S. Kameswari et al., “An Overview of vision Transformers for image Processing: a survey,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 8, Jan. 2023, doi: 10.14569/ijacsa.2023.0140830
S. Lee, J. Kim, H. Kang, D.-Y. Kang, and J. Park, “Genetic algorithm based deep learning neural network structure and hyperparameter optimization,” Applied Sciences, vol. 11, no. 2, p. 744, Jan. 2021, doi: 10.3390/app11020744.
J. Li, R. Dong, X. Wu, W. Huang, and P. Lin, “A Self-Learning Hyper-Heuristic Algorithm based on a genetic algorithm: a case study on prefabricated modular cabin unit logistics scheduling in a cruise ship manufacturer,” Biomimetics, vol. 9, no. 9, p. 516, Aug. 2024, doi: 10.3390/biomimetics9090516.
S. A. Alex, J. J. V. Nayahi, and S. Kaddoura, “Deep convolutional neural networks with genetic algorithm-based synthetic minority over-sampling technique for improved imbalanced data classification,” Applied Soft Computing, vol. 156, p. 111491, Mar. 2024, doi: 10.1016/j.asoc.2024.111491.
O. I. Obaid, M. Mohammed, A. O. Salman, S. A. Mostafa, and A. Elngar, “Comparing the performance of pre-trained deep learning models in object detection and recognition,” Journal of Information Technology Management, 14, 4, pp. 40-56, 2022, doi: 10.22059/jitm.2022.88134.
A. A. Ojugo and O. Nwankwo, “Spectral-Cluster solution for Credit-Card fraud detection using a genetic algorithm trained modular deep learning neural network,” JINAV Journal of Information and Visualization, vol. 2, no. 1, pp. 15–24, Jan. 2021, doi: 10.35877/454ri.jinav274.
T. S. Prajwal and I. A. K, “A Comparative Study Of RESNET-Pretrained Models For Computer Vision,” IC3-2023: Proceedings of the 2023 Fifteenth International Conference on Contemporary Computing, pp. 419–425, Aug. 2023, doi: 10.1145/3607947.3608042.
F. Radenovic, A. Iscen, G. Tolias, Y. Avrithis and O. Chum, "Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking," 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 2018, pp. 5706-5715, doi: 10.1109/CVPR.2018.00598.
Y. Li, “The Investigation of DeiT model Based on PaddlePaddle Framework on CIFAR-10 Dataset Image Classification,” in Advances in computer science research, 2023, pp. 1062–1067. doi: 10.2991/978-94-6463-300-9_106.
R. Khosrowshahli, F. Kheiri, A. A. Bidgoli, H. R. Tizhoosh, M. Makrehchi, and S. Rahnamayan, “Enhancing image retrieval through optimal barcode representation,” Scientific Reports, vol. 15, no. 1, Aug. 2025, doi: 10.1038/s41598-025-14576-x.
M. Nallappan and R. Velswamy, “Exploring deep learning-based content-based video retrieval with Hierarchical Navigable Small World index and ResNet-50 features for anomaly detection,” Expert Systems With Applications, vol. 247, p. 123197, Jan. 2024, doi: 10.1016/j.eswa.2024.123197.
M. S. Rao, “Hybrid Deep Learning Approach for Marine Debris Detection in Satellite Imagery Using UNet with ResNext50 Backbone,” Journal of Applied Science and Technology Trends, vol. 6, no. 1, pp. 50–60, Jun. 2025, doi: 10.38094/jastt61243.
V. S. Anagani, A. Rani, P. Panuganti, and M. Tharangini, “Pancreatic Cancer Detection Using Quaternion Wavelet Transform and Squeeze-and-Excitation Network with SVM Classifier,” Journal of Applied Science and Technology Trends, vol. 6, no. 2, pp. 194–202, Aug. 2025, doi: 10.38094/jastt62269.
G. Gautam and A. Khanna, “Content Based Image Retrieval System Using CNN based Deep Learning Models,” Procedia Computer Science, vol. 235, pp. 3131–3141, Jan. 2024, doi: 10.1016/j.procs.2024.04.296.
Z. Chao, S. Cheng, and Y. Li, “Deep internally connected transformer hashing for image retrieval,” Knowledge-Based Systems, vol. 279, p. 110953, Sep. 2023, doi: 10.1016/j.knosys.2023.110953.
A. P and G. R, “Optimizing visual data retrieval using deep learning driven CBIR for improved human machine interaction,” Scientific Reports, vol. 15, no. 1, Jul. 2025, doi: 10.1038/s41598-025-05478-z.
R. Kumar and N. M. M. S, “Enhancing Content-based Image Retrieval Performance through Optimized Feature Selection,” Engineering Technology & Applied Science Research, vol. 15, no. 3, pp. 23783–23789, Jun. 2025, doi: 10.48084/etasr.10974.

Downloads

Download data is not yet available.

Abstract

Keywords

References

Downloads

Similar Articles