no code implementations • 17 Apr 2024 • Wenbo Zhang, Yifan Zhang, Jianfeng Lin, Binqiang Huang, Jinlu Zhang, Wenhao Yu
Pre-trained vision-language (V-L) models such as CLIP have shown excellent performance in many downstream cross-modal tasks.
1 code implementation • 21 Nov 2023 • Xiao Liu, Jianfeng Lin, Jiawei Zhang
The proliferation of Large Language Models like ChatGPT has significantly advanced language understanding and generation, impacting a broad spectrum of applications.