Auto-Encoding Score Distribution Regression for Action Quality Assessment

22 Nov 2021  ยท  Boyu Zhang, Jiayuan Chen, Yinfei Xu, HUI ZHANG, Xu Yang, Xin Geng ยท

The action quality assessment (AQA) of videos is a challenging vision task since the relation between videos and action scores is difficult to model. Thus, AQA has been widely studied in the literature. Traditionally, AQA is treated as a regression problem to learn the underlying mappings between videos and action scores. But previous methods ignored data uncertainty in AQA dataset. To address aleatoric uncertainty, we further develop a plug-and-play module Distribution Auto-Encoder (DAE). Specifically, it encodes videos into distributions and uses the reparameterization trick in variational auto-encoders (VAE) to sample scores, which establishes a more accurate mapping between videos and scores. Meanwhile, a likelihood loss is used to learn the uncertainty parameters. We plug our DAE approach into MUSDL and CoRe. Experimental results on public datasets demonstrate that our method achieves state-of-the-art on AQA-7, MTL-AQA, and JIGSAWS datasets. Our code is available at https://github.com/InfoX-SEU/DAE-AQA.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Action Quality Assessment AQA-7 DAE-MLP Spearman Correlation 82.58% # 3
Action Quality Assessment AQA-7 DAE-CoRe Spearman Correlation 85.20% # 1
Action Quality Assessment JIGSAWS DAE-MLP Spearman Correlation 0.72 # 3
Action Quality Assessment JIGSAWS DAE-CoRe Spearman Correlation 0.86 # 1
Action Quality Assessment JIGSAWS DAE-MT Spearman Correlation 0.76 # 2
Action Quality Assessment MTL-AQA DAE-MT Spearman Correlation 94.52 # 3
Action Quality Assessment MTL-AQA DAE-CoRe Spearman Correlation 95.89 # 1
Action Quality Assessment MTL-AQA DAE-MLP Spearman Correlation 92.31 # 8

Methods


No methods listed for this paper. Add relevant methods here