TVQA+ contains 310.8K bounding boxes, linking depicted objects to visual concepts in questions and answers.

Source: TVQA+: Spatio-Temporal Grounding for Video Question Answering

Papers


Paper Code Results Date Stars

Dataset Loaders


Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages