Relational Embedding for Few-Shot Classification

ICCV 2021  ·  Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho ·

We propose to address the problem of few-shot classification by meta-learning "what to observe" and "where to attend" in a relational perspective. Our method leverages relational patterns within and between images via self-correlational representation (SCR) and cross-correlational attention (CCA). Within each image, the SCR module transforms a base feature map into a self-correlation tensor and learns to extract structural patterns from the tensor. Between the images, the CCA module computes cross-correlation between two image representations and learns to produce co-attention between them. Our Relational Embedding Network (RENet) combines the two relational modules to learn relational embedding in an end-to-end manner. In experimental evaluation, it achieves consistent improvements over state-of-the-art methods on four widely used few-shot classification benchmarks of miniImageNet, tieredImageNet, CUB-200-2011, and CIFAR-FS.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Few-Shot Image Classification CIFAR-FS 5-way (1-shot) RENet Accuracy 74.51 # 27
Few-Shot Image Classification CIFAR-FS 5-way (5-shot) RENet Accuracy 86.60 # 27
Few-Shot Image Classification CUB 200 5-way 1-shot RENet Accuracy 79.49 # 18
Few-Shot Image Classification CUB 200 5-way 5-shot RENet Accuracy 91.11 # 15
Few-Shot Image Classification Mini-Imagenet 5-way (1-shot) RENet Accuracy 67.60 # 40
Few-Shot Image Classification Mini-Imagenet 5-way (5-shot) RENet Accuracy 82.58 # 34
Few-Shot Image Classification Tiered ImageNet 5-way (1-shot) RENet Accuracy 71.61 # 27
Few-Shot Image Classification Tiered ImageNet 5-way (5-shot) RENet Accuracy 85.28 # 31

Methods


No methods listed for this paper. Add relevant methods here