Activity Detection

63 papers with code • 1 benchmarks • 12 datasets

Detecting activities in extended videos.

Libraries

Use these libraries to find Activity Detection models and implementations

Latest papers with no code

A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model

no code yet • 5 Feb 2024

Criminal and suspicious activity detection has become a popular research topic in recent years.

Joint User Detection and Localization in Near-Field Using Reconfigurable Intelligent Surfaces

no code yet • 4 Feb 2024

This letter studies the problem of jointly detecting active user equipments (UEs) and estimating their location in the near field, wherein the base station (BS) is unaware of the number of active (or inactive) UEs and their positions.

Activity Detection for Massive Connectivity in Cell-free Networks with Unknown Large-scale Fading, Channel Statistics, Noise Variance, and Activity Probability: A Bayesian Approach

no code yet • 30 Jan 2024

This problem is even more severe in cell-free networks as there are many of these parameters to be acquired.

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

no code yet • 16 Jan 2024

The proposed method can take audio-visual input and leverage the speaker's acoustic footprint or lip track to flexibly conduct audio-based, video-based, and audio-visual speaker diarization in a unified sequence-to-sequence framework.

Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments

no code yet • 7 Jan 2024

Speech separation involves extracting an individual speaker's voice from a multi-speaker audio signal.

Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions

no code yet • 27 Dec 2023

Our experiments show that self-supervised pretraining not only improves performance in clean conditions, but also yields models which are more robust to adverse conditions compared to purely supervised learning.

Spatiotemporal Event Graphs for Dynamic Scene Understanding

no code yet • 11 Dec 2023

In this thesis, we present a series of frameworks for dynamic scene understanding starting from road event detection from an autonomous driving perspective to complex video activity detection, followed by continual learning approaches for the life-long learning of the models.

Towards More Practical Group Activity Detection: A New Benchmark and Model

no code yet • 5 Dec 2023

Group activity detection (GAD) is the task of identifying members of each group and classifying the activity of the group at the same time in a video.

SPIRE-SIES: A Spontaneous Indian English Speech Corpus

no code yet • 1 Dec 2023

Transcripts for 23 hours is generated and validated which can serve as a spontaneous speech ASR benchmark.

Combatting Human Trafficking in the Cyberspace: A Natural Language Processing-Based Methodology to Analyze the Language in Online Advertisements

no code yet • 22 Nov 2023

This project tackles the pressing issue of human trafficking in online C2C marketplaces through advanced Natural Language Processing (NLP) techniques.