Ranjan Biswal, M., & Baliarsingh, S. K. (2026). Integrating Vision and Language: An Improved VAD Model. Journal of Applied Science and Technology Trends, 7(1), 137-156. https://doi.org/10.38094/jastt71658