Lightweight Convolution

Introduced by Wu et al. in Pay Less Attention with Lightweight and Dynamic Convolutions

LightConv is a type of depthwise convolution for sequential modelling which shares certain output channels and whose weights are normalized across the temporal dimension using a softmax. Compared to self-attention, LightConv has a fixed context window and it determines the importance of context elements with a set of weights that do not change over time steps. LightConv computes the following for the $i$-th element in the sequence and output channel $c$:

$$ \text{LightConv}\left(X, W_{\text{ceil}\left(\frac{cH}{d}\right),:}, i, c\right) = \text{DepthwiseConv}\left(X,\text{softmax}\left(W_{\text{ceil}\left(\frac{cH}{d}\right),:}\right), i, c\right) $$

Source: Pay Less Attention with Lightweight and Dynamic Convolutions

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Abstractive Text Summarization	1	25.00%
Language Modelling	1	25.00%
Machine Translation	1	25.00%
Translation	1	25.00%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Depthwise Convolution	Convolutions
Softmax	Output Functions

Categories

Add Remove

Convolutions

Temporal Convolutions