ELINJULLIPARAMBIL, Sajud Hamza. Generalist Vision Models for Any-to-Any Image-to-Video Understanding. International Journal of Emerging Trends in Computer Science and Information Technology, [S. l.], v. 6, n. 3, p. 112–120, 2025. DOI: 10.63282/3050-9246.IJETCSIT-V6I3P117. Disponível em: https://ijetcsit.org/index.php/ijetcsit/article/view/528. Acesso em: 30 jul. 2026.