Elinjulliparambil, Sajud Hamza. “Generalist Vision Models for Any-to-Any Image-to-Video Understanding”. International Journal of Emerging Trends in Computer Science and Information Technology, vol. 6, no. 3, Aug. 2025, pp. 112-20, https://doi.org/10.63282/3050-9246.IJETCSIT-V6I3P117.