1 code implementation • 4 Apr 2024 • Tiantian Geng, Teng Wang, yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng
Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL).
1 code implementation • CVPR 2023 • Tiantian Geng, Teng Wang, Jinming Duan, Runmin Cong, Feng Zheng
To better adapt to real-life applications, in this paper we focus on the task of dense-localizing audio-visual events, which aims to jointly localize and recognize all audio-visual events occurring in an untrimmed video.
Ranked #1 on audio-visual event localization on UnAV-100
no code implementations • TIP 2022 • Tiantian Geng, Feng Zheng, Xiaorong Hou, Ke Lu, Guo-Jun Qi, Ling Shao
Spatial-temporal relation reasoning is a significant yet challenging problem for video action recognition.
Ranked #35 on Action Recognition on Something-Something V1
1 code implementation • 8 Mar 2022 • Jingfei Xia, Mingchen Zhuge, Tiantian Geng, Shun Fan, Yuantai Wei, Zhenyu He, Feng Zheng
Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music.