Search Results for author: Yurui Zhang

Found 1 papers, 1 papers with code

Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning

1 code implementation • 12 Jul 2023 • Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp

This makes us wonder if, based on visual cues, Vision-Language Models that are pre-trained with large-scale image-text resources can achieve and even outperform human's capability in reasoning times and location.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.