Self-Supervised Action Recognition
34 papers with code • 6 benchmarks • 5 datasets
Latest papers with no code
Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
There is a natural correlation between the visual and auditive elements of a video.
Learning and Using the Arrow of Time
We seek to understand the arrow of time in videos -- what makes videos look like they are playing forwards or backwards?
Self-Supervised Video Representation Learning With Odd-One-Out Networks
On action classification, our method obtains 60. 3\% on the UCF101 dataset using only UCF101 data for training which is approximately 10% better than current state-of-the-art self-supervised learning methods.
Generating Videos with Scene Dynamics
We capitalize on large amounts of unlabeled video in order to learn a model of scene dynamics for both video recognition tasks (e. g. action classification) and video generation tasks (e. g. future prediction).
Shuffle and Learn: Unsupervised Learning using Temporal Order Verification
With this simple task and no semantic labels, we learn a powerful visual representation using a Convolutional Neural Network (CNN).