1 code implementation • 5 Feb 2024 • Yoad Tewel, Omri Kaduri, Rinon Gal, Yoni Kasten, Lior Wolf, Gal Chechik, Yuval Atzmon
Text-to-image models offer a new level of creative flexibility by allowing users to guide the image generation process through natural language.
no code implementations • 2 May 2023 • Yoad Tewel, Rinon Gal, Gal Chechik, Yuval Atzmon
The task of T2I personalization poses multiple hard challenges, such as maintaining high visual fidelity while allowing creative control, combining multiple personalized concepts in a single image, and keeping a small model size.
1 code implementation • 22 Jul 2022 • Yoad Tewel, Yoav Shalev, Roy Nadler, Idan Schwartz, Lior Wolf
We introduce a zero-shot video captioning method that employs two frozen networks: the GPT-2 language model and the CLIP image-text matching model.
1 code implementation • 19 Jun 2022 • Tal Shaharabany, Yoad Tewel, Lior Wolf
Moreover, training takes place in a weakly supervised setting, where no bounding boxes are provided.
1 code implementation • CVPR 2022 • Yoad Tewel, Yoav Shalev, Idan Schwartz, Lior Wolf
While such models can provide a powerful score for matching and subsequent zero-shot tasks, they are not capable of generating caption given an image.