2 code implementations • 5 Apr 2024 • Wenyi Mo, Tianyu Zhang, Yalong Bai, Bing Su, Ji-Rong Wen, Qing Yang
Users assign weights or alter the injection time steps of certain words in the text prompts to improve the quality of generated images.
2 code implementations • 16 Sep 2022 • Jiangmeng Li, Wenwen Qiang, Yanan Zhang, Wenyi Mo, Changwen Zheng, Bing Su, Hui Xiong
As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample.
no code implementations • 23 May 2022 • Jiangmeng Li, Wenyi Mo, Wenwen Qiang, Bing Su, Changwen Zheng
Vision-language models are pre-trained by aligning image-text pairs in a common space so that the models can deal with open-set visual concepts by learning semantic information from textual labels.