Vision Transformers

CrossTransformers

Introduced by Doersch et al. in CrossTransformers: spatially-aware few-shot transfer

CrossTransformers is a Transformer-based neural network architecture which can take a small number of labeled images and an unlabeled query, find coarse spatial correspondence between the query and the labeled images, and then infer class membership by computing distances between spatially-corresponding features.

Source: CrossTransformers: spatially-aware few-shot transfer

Papers


Paper Code Results Date Stars

Tasks


Components


Component Type
🤖 No Components Found You can add them if they exist; e.g. Mask R-CNN uses RoIAlign

Categories