Search Results for author: Kirolos Ataallah

MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens

This paper introduces MiniGPT4-Video, a multimodal Large Language Model (LLM) designed specifically for video understanding.

370

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.