1 code implementation • 29 Apr 2024 • Guillaume Astruc, Nicolas Dufour, Ioannis Siglidis, Constantin Aronssohn, Nacim Bouia, Stephanie Fu, Romain Loiseau, Van Nguyen Nguyen, Charles Raude, Elliot Vincent, Lintao XU, HongYu Zhou, Loic Landrieu
Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms.
no code implementations • 19 Apr 2024 • Xi Wang, Nicolas Dufour, Nefeli Andreou, Marie-Paule Cani, Victoria Fernandez Abrevaya, David Picard, Vicky Kalogeiton
Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models.
no code implementations • 21 Mar 2023 • Robin Courant, Maika Edberg, Nicolas Dufour, Vicky Kalogeiton
For image classification, the most common Transformer Architecture uses only the Transformer Encoder in order to transform the various input tokens.
1 code implementation • 10 Oct 2022 • Nicolas Dufour, David Picard, Vicky Kalogeiton
In this work, we introduce SCAM (Semantic Cross Attention Modulation), a system that encodes rich and diverse information in each semantic region of the image (including foreground and background), thus achieving precise generation with emphasis on fine details.
Ranked #1 on Pose Transfer on CelebAMask-HQ