TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Online Action Detection	FineAction	MiniROAD	mAP	37.1	# 1
Online Action Detection	THUMOS'14	MiniROAD	mAP	71.8	# 2
Online Action Detection	THUMOS'14	MiniROAD	MFLOPs per pred	15.8	# 5
Online Action Detection	TVSeries	MiniROAD	mCAP	89.6	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/miniroad-minimal-rnn-framework-for-online/online-action-detection-on-fineaction)](https://paperswithcode.com/sota/online-action-detection-on-fineaction?p=miniroad-minimal-rnn-framework-for-online)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/miniroad-minimal-rnn-framework-for-online/online-action-detection-on-tvseries)](https://paperswithcode.com/sota/online-action-detection-on-tvseries?p=miniroad-minimal-rnn-framework-for-online)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/miniroad-minimal-rnn-framework-for-online/online-action-detection-on-thumos-14)](https://paperswithcode.com/sota/online-action-detection-on-thumos-14?p=miniroad-minimal-rnn-framework-for-online)`

MiniROAD: Minimal RNN Framework for Online Action Detection

ICCV 2023 · Joungbin An, Hyolim Kang, Su Ho Han, Ming-Hsuan Yang, Seon Joo Kim ·

Online Action Detection (OAD) is the task of identifying actions in streaming videos without access to future frames. Much effort has been devoted to effectively capturing long-range dependencies, with transformers receiving the spotlight for their ability to capture long-range temporal structures. In contrast, RNNs have received less attention lately, due to their lower performance compared to recent methods that utilize transformers. In this paper, we investigate the underlying reasons for the inferior performance of RNNs compared to transformer-based algorithms. Our findings indicate that the discrepancy between training and inference is the primary hindrance to the effective training of RNNs. To address this, we propose applying non-uniform weights to the loss computed at each time step, which allows the RNN model to learn from the predictions made in an environment that better resembles the inference stage. Extensive experiments on three benchmark datasets, THUMOS, TVSeries, and FineAction demonstrate that a minimal RNN-based model trained with the proposed methodology performs equally or better than the existing best methods with a significant increase in efficiency. The code is available at https://github.com/jbistanbul/MiniROAD.

PDF Abstract

Code

Add Remove Mark official

jbistanbul/miniroad official

Tasks

Add Remove

Action Detection

Online Action Detection

Datasets

ActivityNet

THUMOS14 TVSeries

FineAction

Results from the Paper

Add Remove

Ranked #1 on Online Action Detection on TVSeries

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Online Action Detection	FineAction	MiniROAD	mAP	37.1	# 1	Compare
Online Action Detection	THUMOS'14	MiniROAD	mAP	71.8	# 2	Compare
Online Action Detection	THUMOS'14	MiniROAD	MFLOPs per pred	15.8	# 5	Compare
Online Action Detection	TVSeries	MiniROAD	mCAP	89.6	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

MiniROAD: Minimal RNN Framework for Online Action Detection

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove