Search Results for author: Shigeki Saito

Found 1 papers, 0 papers with code

Evaluating Image Review Ability of Vision Language Models

no code implementations19 Feb 2024 Shigeki Saito, Kazuki Hayashi, Yusuke Ide, Yusuke Sakai, Kazuma Onishi, Toma Suzuki, Seiji Gobara, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Large-scale vision language models (LVLMs) are language models that are capable of processing images and text inputs by a single model.

Image Captioning

Cannot find the paper you are looking for? You can Submit a new open access paper.