no code implementations • 7 Dec 2022 • Yue Ma, Tianyu Yang, Yin Shan, Xiu Li
This paper presents SimVTP: a Simple Video-Text Pretraining framework via masked autoencoders.
Ranked #16 on Moment Retrieval on Charades-STA
Contrastive Learning Moment Retrieval +1