Action Detection

233 papers with code • 11 benchmarks • 33 datasets

Action Detection aims to find both where and when an action occurs within a video clip and classify what the action is taking place. Typically results are given in the form of action tublets, which are action bounding boxes linked across time in the video. This is related to temporal localization, which seeks to identify the start and end frame of an action, and action recognition, which seeks only to classify which action is taking place and typically assumes a trimmed video.

Libraries

Use these libraries to find Action Detection models and implementations
6 papers
3,908
2 papers
3,001
See all 6 libraries.

Long-term Conversation Analysis: Exploring Utility and Privacy

ol-mega/ppca 28 Jun 2023

The analysis of conversations recorded in everyday life requires privacy protection.

2
28 Jun 2023

E2E-LOAD: End-to-End Long-form Online Action Detection

sqiangcao99/e2e-load ICCV 2023

Furthermore, we propose a novel and efficient inference mechanism that accelerates heavy spatial-temporal exploration.

10
13 Jun 2023

ShuttleSet: A Human-Annotated Stroke-Level Singles Dataset for Badminton Tactical Analysis

wywywang/coachai-projects 8 Jun 2023

With the recent progress in sports analytics, deep learning approaches have demonstrated the effectiveness of mining insights into players' tactics for improving performance quality and fan engagement.

71
08 Jun 2023

FunASR: A Fundamental End-to-End Speech Recognition Toolkit

alibaba-damo-academy/FunASR 18 May 2023

FunASR offers models trained on large-scale industrial corpora and the ability to deploy them in applications.

3,378
18 May 2023

Efficient Video Action Detection with Token Dropout and Context Refinement

MCG-NJU/VideoMAE ICCV 2023

Our EVAD consists of two specialized designs for video action detection.

1,214
17 Apr 2023

WEAR: An Outdoor Sports Dataset for Wearable and Egocentric Activity Recognition

mariusbock/wear 11 Apr 2023

Though research has shown the complementarity of camera- and inertial-based data, datasets which offer both egocentric video and inertial-based sensor data remain scarce.

7
11 Apr 2023

Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection

webber2933/iclip 10 Apr 2023

Finally, we calculate the similarity between the interaction feature and the text feature for each label to determine the action category.

13
10 Apr 2023

Boundary-Denoising for Video Activity Localization

frostinassiky/denoiseloc 6 Apr 2023

To alleviate the boundary ambiguity, we propose to study the video activity localization problem from a denoising perspective.

4
06 Apr 2023

Evaluation of Noise Reduction Methods for Sentence Recognition by Sinhala Speaking Listeners

chathukiket/evaluation-of-noise-reduction-methods 31 Mar 2023

Noise reduction is a crucial aspect of hearing aids, which researchers have been striving to address over the years.

0
31 Mar 2023

DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion

sauradip/diffusiontad ICCV 2023

Concretely, we establish the denoising process in the Transformer decoder (e. g., DETR) by introducing a temporal location query design with faster convergence in training.

28
27 Mar 2023