Elinjulliparambil, Sajud Hamza. “Generalist Vision Models for Any-to-Any Image-to-Video Understanding”. International Journal of Emerging Trends in Computer Science and Information Technology 6, no. 3 (August 24, 2025): 112–120. Accessed January 29, 2026. https://ijetcsit.org/index.php/ijetcsit/article/view/528.