no code implementations • 22 Apr 2024 • Qiwen Deng, Yangcen Liu, Wen Li, Guoqing Wang
Particularly, an SRM filter is utilized to extract high-frequency details, which are combined with spatial features as input to the BSD.
no code implementations • 20 Apr 2024 • Yangcen Liu, Ziyi Liu, Yuanhao Zhai, Wen Li, David Doerman, Junsong Yuan
To address this problem, we propose the Generalizable Temporal Action Localization task (GTAL), which focuses on improving the generalizability of action localization methods.
Weakly-supervised Temporal Action Localization Weakly Supervised Temporal Action Localization
no code implementations • 18 Apr 2024 • Xunsong Li, Pengzhan Sun, Yangcen Liu, Lixin Duan, Wen Li
Existing methods usually adopt a two-stage pipeline, where object proposals are first detected using a pretrained detector, and then are fed to an action recognition model for extracting video features and learning the object relations for action recognition.