Gated Positional Self-Attention

Introduced by d'Ascoli et al. in ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Gated Positional Self-Attention (GPSA) is a self-attention module for vision transformers, used in the ConViT architecture, that can be initialized as a convolutional layer -- helping a ViT learn inductive biases about locality.

Source: ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Image Classification	2	50.00%
Language Modelling	1	25.00%
Fine-Grained Image Classification	1	25.00%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Dropout	Regularization
Layer Normalization	Normalization
Scaled Dot-Product Attention	Attention Mechanisms
Softmax	Output Functions

Categories

Add Remove

Attention Modules