1 dataset result for Video Captioning AND Time series

We provide a dataset called MMAC Captions for sensor-augmented egocentric-video captioning. The dataset contains 5,002 activity descriptions by extending the CMU-MMAC dataset. A number of activity description examples can be found in the homepage.

2 PAPERS • NO BENCHMARKS YET

Datasets

1 dataset result for Video Captioning AND Time series