Full-Sentence Visual Question Answering (FSVQA) dataset, consisting of nearly 1 million pairs of questions and full-sentence answers for images, built by applying a number of rule-based natural language processing techniques to original VQA dataset and captions in the MS COCO dataset.
Source: The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA)Paper | Code | Results | Date | Stars |
---|