3D Action Recognition

34 papers with code • 3 benchmarks • 14 datasets

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Action Recognition

Dataset	Best Model	Compare
Assembly101	HandFormer-B/21	See all
NTU RGB+D	Kinet	See all
100 sleep nights of 8 caregivers	htf	See all

Libraries

Use these libraries to find 3D Action Recognition models and implementations

open-mmlab/mmaction2

2 papers

3,916

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Real-time 3D human action recognition based on Hyperpoint sequence

xingli1012/sequentialpointnet • • 16 Nov 2021

Instead of capturing spatio-temporal local structures, SequentialPointNet encodes the temporal evolution of static appearances to recognize human actions.

Paper
Code

3DVNet: Multi-View Depth Prediction and Volumetric Refinement

alexrich021/3dvnet • • 1 Dec 2021

Furthermore, unlike existing volumetric MVS techniques, our 3D CNN operates on a feature-augmented point cloud, allowing for effective aggregation of multi-view information and flexible iterative refinement of depth maps.

Paper
Code

Deep Hierarchical Representation of Point Cloud Videos via Spatio-Temporal Decomposition

hehefan/PSTNet2 • • IEEE Transactions on Pattern Analysis and Machine Intelligence 2021

Specifically, a spatial operation is employed to capture the local structure of each spatial region in a tube and a temporal operation is used to model the dynamics of the spatial regions along the tube.

Paper
Code

Domain Knowledge-Informed Self-Supervised Representations for Workout Form Assessment

ParitoshParmar/Fitness-AQA • 28 Feb 2022

To that end, we propose to learn exercise-oriented image and video representations from unlabeled samples such that a small dataset annotated by experts suffices for supervised error detection.

Paper
Code

No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces

jx-zhong-for-academic-purpose/kinet • • CVPR 2022

Scene flow is a powerful tool for capturing the motion field of 3D point clouds.

Paper
Code

Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

assembly101/assembly101.github.io • CVPR 2022

Assembly101 is a new procedural activity dataset featuring 4321 videos of people assembling and disassembling 101 "take-apart" toy vehicles.

Paper
Code

PSTNet: Point Spatio-Temporal Convolution on Point Cloud Sequences

hehefan/Point-Spatio-Temporal-Convolution • • ICLR 2021

Then, a spatial convolution is employed to capture the local structure of points in the 3D space, and a temporal convolution is used to model the dynamics of the spatial regions along the time dimension.

Paper
Code

Multi-Scale Spatial Temporal Graph Convolutional Network for Skeleton-Based Action Recognition

czhaneva/mst-gcn • • 27 Jun 2022

To solve this problem, we present a multi-scale spatial graph convolution (MS-GC) module and a multi-scale temporal graph convolution (MT-GC) module to enrich the receptive field of the model in spatial and temporal dimensions.

Paper
Code

Collaborating Domain-shared and Target-specific Feature Clustering for Cross-domain 3D Action Recognition

canbaoburen/CoDT • • 20 Jul 2022

Furthermore, to leverage the complementarity of domain-shared features and target-specific features, we propose a novel collaborative clustering strategy to enforce pair-wise relationship consistency between the two branches.

Paper
Code

CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

maoyunyao/cmd • • 26 Aug 2022

In this work, we formulate the cross-modal interaction as a bidirectional knowledge distillation problem.

Paper
Code

3D Action Recognition

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result