Search Results for author: Shigeki Saito

Found 1 papers, 0 papers with code

Evaluating Image Review Ability of Vision Language Models

no code implementations • 19 Feb 2024 • Shigeki Saito, Kazuki Hayashi, Yusuke Ide, Yusuke Sakai, Kazuma Onishi, Toma Suzuki, Seiji Gobara, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Large-scale vision language models (LVLMs) are language models that are capable of processing images and text inputs by a single model.

Image Captioning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.