no code implementations • 3 Mar 2024 • Yongchao Du, Min Wang, Wengang Zhou, Shuping Hui, Houqiang Li
To tackle the above problems, we propose Image2Sentence based Asymmetric zero-shot composed image retrieval (ISA), which takes advantage of the VL model and only relies on unlabeled images for composition learning.