5 papers with code ·

Benchmarks

You can find evaluation results in the subtasks. You can also submitting evaluation metrics for this task.

Greatest papers with code

A Multigrid Method for Efficiently Training Video Models

CVPR 2020 facebookresearch/SlowFast

We empirically demonstrate a general and robust grid schedule that yields a significant out-of-the-box training speedup without a loss in accuracy for different models (I3D, non-local, SlowFast), datasets (Kinetics, Something-Something, Charades), and training settings (with and without pre-training, 128 GPUs or 1 GPU).

ACTION DETECTION ACTION RECOGNITION VIDEO UNDERSTANDING

SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

7 Mar 2020TiagoCortinhal/SalsaNext

In this paper, we introduce SalsaNext for the uncertainty-aware semantic segmentation of a full 3D LiDAR point cloud in real-time.

3D SEMANTIC SEGMENTATION AUTONOMOUS DRIVING

AViD Dataset: Anonymized Videos from Diverse Countries

10 Jul 2020piergiaj/AViD

We confirm that most of the existing video datasets are statistically biased to only capture action videos from a limited number of countries.

 Ranked #1 on Action Detection on Charades (using extra training data)

ACTION CLASSIFICATION ACTION DETECTION ACTION RECOGNITION

Multi-branch Attentive Transformer

18 Jun 2020HA-Transformer/HA-Transformer

While the multi-branch architecture is one of the key ingredients to the success of computer vision tasks, it has not been well investigated in natural language processing, especially sequence learning tasks.

CODE GENERATION MACHINE TRANSLATION NATURAL LANGUAGE UNDERSTANDING