Momentum Contrast

Introduced by He et al. in Momentum Contrast for Unsupervised Visual Representation Learning

MoCo, or Momentum Contrast, is a self-supervised learning algorithm with a contrastive loss.

Contrastive loss methods can be thought of as building dynamic dictionaries. The "keys" (tokens) in the dictionary are sampled from data (e.g., images or patches) and are represented by an encoder network. Unsupervised learning trains encoders to perform dictionary look-up: an encoded “query” should be similar to its matching key and dissimilar to others. Learning is formulated as minimizing a contrastive loss.

MoCo can be viewed as a way to build large and consistent dictionaries for unsupervised learning with a contrastive loss. In MoCo, we maintain the dictionary as a queue of data samples: the encoded representations of the current mini-batch are enqueued, and the oldest are dequeued. The queue decouples the dictionary size from the mini-batch size, allowing it to be large. Moreover, as the dictionary keys come from the preceding several mini-batches, a slowly progressing key encoder, implemented as a momentum-based moving average of the query encoder, is proposed to maintain consistency.

Source: Momentum Contrast for Unsupervised Visual Representation Learning

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Self-Supervised Learning	52	21.76%
Image Classification	18	7.53%
Object Detection	12	5.02%
Semantic Segmentation	11	4.60%
Self-Supervised Image Classification	7	2.93%
Classification	6	2.51%
Combinatorial Optimization	5	2.09%
Action Recognition	5	2.09%
Retrieval	5	2.09%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Batch Normalization	Normalization
InfoNCE	Loss Functions

Categories

Add Remove

Self-Supervised Learning

Semi-Supervised Learning Methods