TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Action Recognition	Volleyball	SSU (GT)	Accuracy	81.8	# 3
Action Recognition	Volleyball	GTT (VGG19)	Accuracy	82.6	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/social-scene-understanding-end-to-end-multi/action-recognition-in-videos-on-volleyball)](https://paperswithcode.com/sota/action-recognition-in-videos-on-volleyball?p=social-scene-understanding-end-to-end-multi)`

Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition

CVPR 2017 · Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese ·

We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end to generate dense proposal maps that are refined via a novel inference scheme. The temporal consistency is handled via a person-level matching Recurrent Neural Network. The complete model takes as input a sequence of frames and outputs detections along with the estimates of individual actions and collective activities. We demonstrate state-of-the-art performance of our algorithm on multiple publicly available benchmarks.