Search Results for author: Fangtao Shao

Found 3 papers, 1 papers with code

Fine-grained Text-Video Retrieval with Frozen Image Encoders

no code implementations • 14 Jul 2023 • Zuozhuo Dai, Fangtao Shao, Qingkun Su, Zilong Dong, Siyu Zhu

In the second stage, we propose a novel decoupled video text cross attention module to capture fine-grained multimodal information in spatial and temporal dimensions.

Decoder Retrieval +1

Paper
Add Code

Towards Robust Video Instance Segmentation with Temporal-Aware Transformer

no code implementations • 20 Jan 2023 • Zhenghao Zhang, Fangtao Shao, Zuozhuo Dai, Siyu Zhu

In this paper, we observe the temporal information is important as well and we propose TAFormer to aggregate spatio-temporal features both in transformer encoder and decoder.

Decoder Instance Segmentation +2

Paper
Add Code

Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision

1 code implementation • ECCV 2020 • Peng Wu, Jing Liu, Yujia Shi, Yujia Sun, Fangtao Shao, Zhaoyang Wu, Zhiwei Yang

Violence detection has been studied in computer vision for years.

Ranked #9 on Anomaly Detection In Surveillance Videos on XD-Violence

Anomaly Detection In Surveillance Videos

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.