TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Detection	J-HMDB	MR-TS R-CNN	Video-mAP 0.2	74.3	# 9
Action Detection	J-HMDB	MR-TS R-CNN	Video-mAP 0.5	73.09	# 12
Action Detection	J-HMDB	MR-TS R-CNN	Frame-mAP 0.5	58.5	# 9
Skeleton Based Action Recognition	J-HMDB	MR Two-Sream R-CNN	Accuracy (RGB+pose)	71.1	# 7
Action Detection	J-HMDB	TS R-CNN	Video-mAP 0.2	71.1	# 11
Action Detection	J-HMDB	TS R-CNN	Video-mAP 0.5	70.6	# 13
Action Detection	J-HMDB	TS R-CNN	Frame-mAP 0.5	56.9	# 10
Action Recognition	UCF101	MR Two-Sream R-CNN	3-fold Accuracy	91.1	# 68
Action Detection	UCF101-24	TS R-CNN	Frame-mAP 0.5	39.94	# 11
Action Detection	UCF101-24	MR-TS R-CNN	Frame-mAP 0.5	39.63	# 12
Action Detection	UCF Sports	TS R-CNN	Video-mAP 0.2	94.82	# 2
Action Detection	UCF Sports	TS R-CNN	Video-mAP 0.5	94.82	# 2
Action Detection	UCF Sports	TS R-CNN	Frame-mAP 0.5	82.30	# 3
Action Detection	UCF Sports	MR-TS R-CNN	Video-mAP 0.2	94.83	# 1
Action Detection	UCF Sports	MR-TS R-CNN	Video-mAP 0.5	94.67	# 3
Action Detection	UCF Sports	MR-TS R-CNN	Frame-mAP 0.5	84.52	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-region-two-stream-r-cnn-for-action/action-detection-on-ucf-sports)](https://paperswithcode.com/sota/action-detection-on-ucf-sports?p=multi-region-two-stream-r-cnn-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-region-two-stream-r-cnn-for-action/skeleton-based-action-recognition-on-j-hmdb)](https://paperswithcode.com/sota/skeleton-based-action-recognition-on-j-hmdb?p=multi-region-two-stream-r-cnn-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-region-two-stream-r-cnn-for-action/action-detection-on-j-hmdb)](https://paperswithcode.com/sota/action-detection-on-j-hmdb?p=multi-region-two-stream-r-cnn-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-region-two-stream-r-cnn-for-action/action-detection-on-ucf101-24)](https://paperswithcode.com/sota/action-detection-on-ucf101-24?p=multi-region-two-stream-r-cnn-for-action)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multi-region-two-stream-r-cnn-for-action/action-recognition-in-videos-on-ucf101)](https://paperswithcode.com/sota/action-recognition-in-videos-on-ucf101?p=multi-region-two-stream-r-cnn-for-action)`

Multi-region two-stream R-CNN for action detection

European Conference on Computer Vision (ECVV 2016) 2016 · Xiaojiang Peng, Cordelia Schmid ·

We propose a multi-region two-stream R-CNN model for action detection in realistic videos. We start from frame-level action detection based on faster R-CNN [1], and make three contributions: (1) we show that a motion region proposal network generates high-quality proposals , which are complementary to those of an appearance region proposal network; (2) we show that stacking optical flow over several frames significantly improves frame-level action detection; and (3) we embed a multi-region scheme in the faster R-CNN model, which adds complementary information on body parts. We then link frame-level detections with the Viterbi algorithm, and temporally localize an action with the maximum subarray method. Experimental results on the UCF-Sports, J-HMDB and UCF101 action detection datasets show that our approach outperforms the state of the art with a significant margin in both frame-mAP and video-mAP

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Action Detection

Action Recognition

Region Proposal

Skeleton Based Action Recognition

Datasets

UCF101

JHMDB UCF101-24

UCF Sports

Results from the Paper

Add Remove

Ranked #2 on Action Detection on UCF Sports

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Skeleton Based Action Recognition	J-HMDB	MR Two-Sream R-CNN	Accuracy (RGB+pose)	71.1	# 7	Compare
Action Recognition	UCF101	MR Two-Sream R-CNN	3-fold Accuracy	91.1	# 68	Compare
Action Detection	UCF101-24	TS R-CNN	Frame-mAP 0.5	39.94	# 11	Compare
Action Detection	UCF101-24	MR-TS R-CNN	Frame-mAP 0.5	39.63	# 12	Compare
Action Detection	UCF Sports	TS R-CNN	Video-mAP 0.2	94.82	# 2	Compare
			Video-mAP 0.5	94.82	# 2	Compare
			Frame-mAP 0.5	82.30	# 3	Compare
Action Detection	UCF Sports	MR-TS R-CNN	Video-mAP 0.2	94.83	# 1	Compare
			Video-mAP 0.5	94.67	# 3	Compare
			Frame-mAP 0.5	84.52	# 2	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Action Detection	J-HMDB	MR-TS R-CNN	Video-mAP 0.2	74.3	# 9	See all
			Video-mAP 0.5	73.09	# 12	See all
			Frame-mAP 0.5	58.5	# 9	See all
Action Detection	J-HMDB	TS R-CNN	Video-mAP 0.2	71.1	# 11	See all
			Video-mAP 0.5	70.6	# 13	See all
			Frame-mAP 0.5	56.9	# 10	See all

Methods

Add Remove

Convolution • Faster R-CNN • RoIPool • RPN • Softmax

Edit Social Preview

Multi-region two-stream R-CNN for action detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove