Detecting activities in extended videos.
This thesis explore different approaches using Convolutional and Recurrent Neural Networks to classify and temporally localize activities on videos, furthermore an implementation to achieve it has been proposed.
In this paper, we introduce the concept of learning latent super-events from activity videos, and present how it benefits activity detection in continuous videos.
#2 best model for Action Detection on Multi-THUMOS
We propose an Efficient Activity Detection System, Argus, for Extended Video Analysis in the surveillance scenario.
We introduce a new convolutional layer named the Temporal Gaussian Mixture (TGM) layer and present how it can be used to efficiently capture longer-term temporal information in continuous activity videos.
SOTA for Action Detection on Multi-THUMOS
Being able to detect and recognize human activities is essential for several applications, including personal assistive robotics.
This paper presents a smartphone app that performs real-time voice activity detection based on convolutional neural network.
Convolutional neural networks (CNNs) are inherently subject to invariable filters that can only aggregate local inputs with the same topological structures.