Activity Detection

63 papers with code • 1 benchmarks • 12 datasets

Detecting activities in extended videos.

Libraries

Use these libraries to find Activity Detection models and implementations

Most implemented papers

A Convolutional Neural Network Smartphone App for Real-Time Voice Activity Detection

SIP-Lab/CNN-VAD IEEE Access 2018

This paper presents a smartphone app that performs real-time voice activity detection based on convolutional neural network.

Temporal Gaussian Mixture Layer for Videos

piergiaj/tgm-icml19 ICLR 2019

We introduce a new convolutional layer named the Temporal Gaussian Mixture (TGM) layer and present how it can be used to efficiently capture longer-term temporal information in continuous activity videos.

S3D: Single Shot multi-Span Detector via Fully 3D Convolutional Networks

dazhang-cv/S3D 21 Jul 2018

In this paper, we present a novel Single Shot multi-Span Detector for temporal activity detection in long, untrimmed videos using a simple end-to-end fully three-dimensional convolutional (Conv3D) network.

Structure-Aware Convolutional Neural Networks

vector-1127/SACNNs NeurIPS 2018

Convolutional neural networks (CNNs) are inherently subject to invariable filters that can only aggregate local inputs with the same topological structures.

The Second DIHARD Diarization Challenge: Dataset, task, and baselines

iiscleap/DIHARD_2019_baseline_alltracks 18 Jun 2019

This paper introduces the second DIHARD challenge, the second in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variation in recording equipment, noise conditions, and conversational domain.

Personalized Activity Recognition with Deep Triplet Embeddings

dmbee/fcn-core 15 Jan 2020

The novel subject triplet loss provides the best performance overall, and all personalized deep embeddings out-perform our baseline personalized engineered feature embedding and an impersonal fully convolutional neural network classifier.

Dual Attention in Time and Frequency Domain for Voice Activity Detection

Jo0o0Hyung/Dual-Attention-for-VAD 27 Mar 2020

The results show that the focal loss can improve the performance in various imbalance situations compared to the cross entropy loss, a commonly used loss function in VAD.

audino: A Modern Annotation Tool for Audio and Speech

midas-research/audino 9 Jun 2020

The tool allows audio data and their corresponding annotations to be uploaded and assigned to a user through a key-based API.

RespVAD: Voice Activity Detection via Video-Extracted Respiration Patterns

arnabkmondal/RespVAD 21 Aug 2020

The Respiration Pattern is first extracted from the video focusing on the abdominal-thoracic region of a speaker using an optical flow based method.