TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Action Classification	Kinetics-400	ip-CSN-152 (IG-65M pretraining)	Acc@1	82.5	# 69
Action Classification	Kinetics-400	ip-CSN-152 (IG-65M pretraining)	Acc@5	95.3	# 48
Action Classification	Kinetics-400	ir-CSN-152 (IG-65M pretraining)	Acc@1	82.6	# 68
Action Classification	Kinetics-400	R[2+1]D-152 (IG-65M pretraining)	Acc@1	81.3	# 78
Action Classification	Kinetics-400	R[2+1]D-152 (IG-65M pretraining)	Acc@5	95.1	# 53
Action Classification	Kinetics-400	ip-CSN-152 (Sports-1M pretraining)	Acc@1	79.2	# 108
Action Classification	Kinetics-400	ip-CSN-152 (Sports-1M pretraining)	Acc@5	93.8	# 83
Action Classification	Kinetics-400	ip-CSN-152	Acc@1	77.8	# 126
Action Classification	Kinetics-400	ip-CSN-152	Acc@5	92.8	# 102
Action Recognition	Something-Something V1	R(2+1)D-152 (IG-65M pretraining)	Top 1 Accuracy	51.6	# 47
Action Recognition	Something-Something V1	ir-CSN-101	Top 1 Accuracy	48.4	# 60
Action Recognition	Something-Something V1	ir-CSN-152	Top 1 Accuracy	49.3	# 57
Action Recognition	Something-Something V1	ir-CSN-152 (IG-65M pretraining)	Top 1 Accuracy	52.1	# 43
Action Recognition	Something-Something V1	ip-CSN-152 (IG-65M pretraining)	Top 1 Accuracy	53.3	# 37
Action Recognition	Sports-1M	ip-CSN-101 (RGB)	Video hit@1	74.9	# 2
Action Recognition	Sports-1M	ip-CSN-101 (RGB)	Video hit@5	92.6	# 2
Action Recognition	Sports-1M	ip-CSN-152 (RGB)	Video hit@1	75.5	# 1
Action Recognition	Sports-1M	ip-CSN-152 (RGB)	Video hit@5	92.8	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-classification-with-channel-separated/action-recognition-in-videos-on-sports-1m)](https://paperswithcode.com/sota/action-recognition-in-videos-on-sports-1m?p=video-classification-with-channel-separated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-classification-with-channel-separated/action-recognition-in-videos-on-something-1)](https://paperswithcode.com/sota/action-recognition-in-videos-on-something-1?p=video-classification-with-channel-separated)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/video-classification-with-channel-separated/action-classification-on-kinetics-400)](https://paperswithcode.com/sota/action-classification-on-kinetics-400?p=video-classification-with-channel-separated)`

Video Classification with Channel-Separated Convolutional Networks

ICCV 2019 · Du Tran, Heng Wang, Lorenzo Torresani, Matt Feiszli ·

Group convolution has been shown to offer great computational savings in various 2D convolutional architectures for image classification. It is natural to ask: 1) if group convolution can help to alleviate the high computational cost of video classification networks; 2) what factors matter the most in 3D group convolutional networks; and 3) what are good computation/accuracy trade-offs with 3D group convolutional networks. This paper studies the effects of different design choices in 3D group convolutional networks for video classification. We empirically demonstrate that the amount of channel interactions plays an important role in the accuracy of 3D group convolutional networks. Our experiments suggest two main findings. First, it is a good practice to factorize 3D convolutions by separating channel interactions and spatiotemporal interactions as this leads to improved accuracy and lower computational cost. Second, 3D channel-separated convolutions provide a form of regularization, yielding lower training accuracy but higher test accuracy compared to 3D convolutions. These two empirical findings lead us to design an architecture -- Channel-Separated Convolutional Network (CSN) -- which is simple, efficient, yet accurate. On Sports1M, Kinetics, and Something-Something, our CSNs are comparable with or better than the state-of-the-art while being 2-3 times more efficient.

PDF Abstract ICCV 2019 PDF ICCV 2019 Abstract

Code

Add Remove Mark official

facebookresearch/VMZ official

1,033

open-mmlab/mmaction2

3,876

facebookresearch/R2Plus1D

1,033

BB-Repos/BBaction

salinasJJ/BBaction

See all 7 implementations

Tasks

Add Remove

Action Classification

Action Recognition

General Classification

Image Classification

Video Classification

Datasets

Kinetics

Kinetics 400

Sports-1M

Something-Something V1

Results from the Paper

Edit

Ranked #1 on Action Recognition on Sports-1M

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Action Classification	Kinetics-400	ip-CSN-152 (IG-65M pretraining)	Acc@1	82.5	# 69	Compare
Action Classification	Kinetics-400	ip-CSN-152 (IG-65M pretraining)	Acc@5	95.3	# 48	Compare
Action Classification	Kinetics-400	ir-CSN-152 (IG-65M pretraining)	Acc@1	82.6	# 68	Compare
Action Classification	Kinetics-400	R[2+1]D-152 (IG-65M pretraining)	Acc@1	81.3	# 78	Compare
Action Classification	Kinetics-400	R[2+1]D-152 (IG-65M pretraining)	Acc@5	95.1	# 53	Compare
Action Classification	Kinetics-400	ip-CSN-152 (Sports-1M pretraining)	Acc@1	79.2	# 108	Compare
Action Classification	Kinetics-400	ip-CSN-152 (Sports-1M pretraining)	Acc@5	93.8	# 83	Compare
Action Classification	Kinetics-400	ip-CSN-152	Acc@1	77.8	# 126	Compare
Action Classification	Kinetics-400	ip-CSN-152	Acc@5	92.8	# 102	Compare
Action Recognition	Something-Something V1	R(2+1)D-152 (IG-65M pretraining)	Top 1 Accuracy	51.6	# 47	Compare
Action Recognition	Something-Something V1	ir-CSN-101	Top 1 Accuracy	48.4	# 60	Compare
Action Recognition	Something-Something V1	ir-CSN-152	Top 1 Accuracy	49.3	# 57	Compare
Action Recognition	Something-Something V1	ir-CSN-152 (IG-65M pretraining)	Top 1 Accuracy	52.1	# 43	Compare
Action Recognition	Something-Something V1	ip-CSN-152 (IG-65M pretraining)	Top 1 Accuracy	53.3	# 37	Compare
Action Recognition	Sports-1M	ip-CSN-101 (RGB)	Video hit@1	74.9	# 2	Compare
Action Recognition	Sports-1M	ip-CSN-101 (RGB)	Video hit@5	92.6	# 2	Compare
Action Recognition	Sports-1M	ip-CSN-152 (RGB)	Video hit@1	75.5	# 1	Compare
Action Recognition	Sports-1M	ip-CSN-152 (RGB)	Video hit@5	92.8	# 1	Compare

Methods

Add Remove

Convolution

Edit Social Preview

Video Classification with Channel-Separated Convolutional Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove