Search Results for author: Tiantian Geng

Found 4 papers, 3 papers with code

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

1 code implementation4 Apr 2024 Tiantian Geng, Teng Wang, yanfu Zhang, Jinming Duan, Weili Guan, Feng Zheng

Video localization tasks aim to temporally locate specific instances in videos, including temporal action localization (TAL), sound event detection (SED) and audio-visual event localization (AVEL).

audio-visual event localization Event Detection +2

Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline

1 code implementation CVPR 2023 Tiantian Geng, Teng Wang, Jinming Duan, Runmin Cong, Feng Zheng

To better adapt to real-life applications, in this paper we focus on the task of dense-localizing audio-visual events, which aims to jointly localize and recognize all audio-visual events occurring in an untrimmed video.

audio-visual event localization

Skating-Mixer: Long-Term Sport Audio-Visual Modeling with MLPs

1 code implementation8 Mar 2022 Jingfei Xia, Mingchen Zhuge, Tiantian Geng, Shun Fan, Yuantai Wei, Zhenyu He, Feng Zheng

Figure skating scoring is challenging because it requires judging the technical moves of the players as well as their coordination with the background music.

Representation Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.