VOT2016 is a video dataset for visual object tracking. It contains 60 video clips and 21,646 corresponding ground truth maps with pixel-wise annotation of salient objects.
Source: Video Saliency Detection by 3D Convolutional Neural NetworksPaper | Code | Results | Date | Stars |
---|