The Flickr30K Entities dataset is an extension to the Flickr30K dataset. It augments the original 158k captions with 244k coreference chains, linking mentions of the same entities across different captions for the same image, and associating them with 276k manually annotated bounding boxes. This is used to define a new benchmark for localization of textual entity mentions in an image.

Source: http://bryanplummer.com/Flickr30kEntities/

Papers


Paper Code Results Date Stars

Tasks


Similar Datasets


Modalities


Languages