no code implementations • 18 Apr 2024 • Jie Ma, Min Hu, Pinghui Wang, Wangchun Sun, Lingyun Song, Hongbin Pei, Jun Liu, Youtian Du
The former leads to a large, diverse test space, while the latter results in a comprehensive robustness evaluation on rare, frequent, and overall questions.
Audio-visual Question Answering Audio-Visual Question Answering (AVQA) +3
1 code implementation • 18 Mar 2024 • Hang Wang, Zhi-Qi Cheng, Youtian Du, Lei Zhang
Our research addresses the shortfall by introducing a novel approach to VAC, called Irregular Video Action Counting (IVAC).