Browse SoTA > Computer Vision > Video > Video Recognition

Video Recognition

22 papers with code ยท Computer Vision
Subtask of Video

Benchmarks

No evaluation results yet. Help compare methods by submit evaluation metrics.

Latest papers without code

Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

6 Aug 2020

In this paper, we consider a task of stopping the video stream recognition process of a text field, in which each frame is recognized independently and the individual results are combined together.

VIDEO RECOGNITION

MOMS with Events: Multi-Object Motion Segmentation With Monocular Event Cameras

11 Jun 2020

Segmentation of moving objects in dynamic scenes is a key process in scene understanding for both navigation and video recognition tasks.

MOTION SEGMENTATION SCENE UNDERSTANDING VIDEO RECOGNITION

X3D: Expanding Architectures for Efficient Video Recognition

CVPR 2020

This paper presents X3D, a family of efficient video networks that progressively expand a tiny 2D image classification architecture along multiple network axes, in space, time, width and depth.

FEATURE SELECTION IMAGE CLASSIFICATION VIDEO CLASSIFICATION VIDEO RECOGNITION

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

CVPR 2020

It is beneficial to incorporate more discriminative features to improve generalization on tail classes.

FEW-SHOT LEARNING VIDEO RECOGNITION

Clean-Label Backdoor Attacks on Video Recognition Models

CVPR 2020

We propose the use of a universal adversarial trigger as the backdoor trigger to attack video recognition models, a situation where backdoor attacks are likely to be challenged by the above 4 strict conditions.

IMAGE CLASSIFICATION VIDEO RECOGNITION

TAM: Temporal Adaptive Module for Video Recognition

14 May 2020

Temporal modeling is crucial for capturing spatiotemporal structure in videos for action recognition.

ACTION RECOGNITION VIDEO RECOGNITION

Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

12 May 2020

Inspired by such capability of humans, to imitate humans' ability of learning visual primitives and composing primitives to recognize novel classes, we propose an approach to FSL to learn a feature representation composed of important primitives, which is jointly trained with two parts, i. e. primitive discovery and primitive enhancing.

FEW-SHOT IMAGE CLASSIFICATION VIDEO RECOGNITION

X3D: Expanding Architectures for Efficient Video Recognition

CVPR 2020

This paper presents X3D, a family of efficient video networks that progressively expand a tiny 2D image classification architecture along multiple network axes, in space, time, width and depth.

FEATURE SELECTION IMAGE CLASSIFICATION VIDEO CLASSIFICATION VIDEO RECOGNITION