no code implementations • 18 Apr 2024 • Han Fang, Xianghao Zang, Chao Ban, Zerun Feng, Lanxiang Zhou, Zhongjiang He, Yongxiang Li, Hao Sun
Text-video retrieval aims to find the most relevant cross-modal samples for a given query.
Retrieval Video Retrieval