1 code implementation • 5 Dec 2023 • Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
This paper introduces ViscoNet, a novel method that enhances text-to-image human generation models with visual prompting.
1 code implementation • 18 Apr 2023 • Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
Text-to-image models (T2I) such as StableDiffusion have been used to generate high quality images of people.
Ranked #1 on Pose Transfer on Deep-Fashion (FID metric)
1 code implementation • 9 Mar 2022 • Soon Yau Cheong, Armin Mustafa, Andrew Gilbert
Therefore we propose a new method; Keypoint Pose Encoding (KPE); KPE is 10 times more memory efficient and over 73% faster at generating high quality images from text input conditioned on the pose.