no code implementations • 3 Oct 2023 • Kota Sueyoshi, Takashi Matsubara
We consider that the root of the above issues lies in the text encoder, which often focuses only on individual words and neglects the logical relationships between them.
Image Generation