Ranjan Biswal, Manas, and Santos Kumar Baliarsingh. “Integrating Vision and Language: An Improved VAD Model”. Journal of Applied Science and Technology Trends, vol. 7, no. 1, Mar. 2026, pp. 137-56, https://doi.org/10.38094/jastt71658.