Verse is a new dataset that augments existing multimodal datasets (COCO and TUHOI) with sense labels.
4 PAPERS • NO BENCHMARKS YET