TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Recognition In Videos	Jester (Gesture Recognition)	MFNet	Val	96.68	# 3
Action Recognition In Videos	Something-Something V1	Motion Feature Net	Top 1 Accuracy	43.9	# 2
Action Recognition	Something-Something V1	Motion Feature Net	Top 1 Accuracy	43.9	# 69

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-feature-network-fixed-motion-filter/action-recognition-in-videos-on-something-2)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something-2?p=motion-feature-network-fixed-motion-filter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-feature-network-fixed-motion-filter/action-recognition-in-videos-on-jester-1)](https://paperswithcode.com/sota/action-recognition-in-videos-on-jester-1?p=motion-feature-network-fixed-motion-filter)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/motion-feature-network-fixed-motion-filter/action-recognition-in-videos-on-something-1)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something-1?p=motion-feature-network-fixed-motion-filter)`

Motion Feature Network: Fixed Motion Filter for Action Recognition

ECCV 2018 · Myunggi Lee, Seungeui Lee, Sungjoon Son, Gyu-tae Park, Nojun Kwak ·

Spatio-temporal representations in frame sequences play an important role in the task of action recognition. Previously, a method of using optical flow as a temporal information in combination with a set of RGB images that contain spatial information has shown great performance enhancement in the action recognition tasks. However, it has an expensive computational cost and requires two-stream (RGB and optical flow) framework. In this paper, we propose MFNet (Motion Feature Network) containing motion blocks which make it possible to encode spatio-temporal information between adjacent frames in a unified network that can be trained end-to-end. The motion block can be attached to any existing CNN-based action recognition frameworks with only a small additional cost. We evaluated our network on two of the action recognition datasets (Jester and Something-Something) and achieved competitive performances for both datasets by training the networks from scratch.

PDF Abstract ECCV 2018 PDF ECCV 2018 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Recognition

Action Recognition In Videos

Optical Flow Estimation

Temporal Action Localization

Datasets

ImageNet

UCF101

Kinetics

HMDB51

Something-Something V1

Jester (Gesture Recognition)

Results from the Paper

Edit

Ranked #2 on Action Recognition In Videos on Something-Something V1

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Recognition In Videos	Jester (Gesture Recognition)	MFNet	Val	96.68	# 3	Compare
Action Recognition In Videos	Something-Something V1	Motion Feature Net	Top 1 Accuracy	43.9	# 2	Compare
Action Recognition	Something-Something V1	Motion Feature Net	Top 1 Accuracy	43.9	# 69	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Motion Feature Network: Fixed Motion Filter for Action Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove