Transductive Information Maximization For Few-Shot Learning

We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. Furthermore, we propose a new alternating-direction solver for our mutual-information loss, which substantially speeds up transductive-inference convergence over gradient-based optimization, while yielding similar accuracy. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM outperforms state-of-the-art methods significantly across various datasets and networks, while used on top of a fixed feature extractor trained with simple cross-entropy on the base classes, without resorting to complex meta-learning schemes. It consistently brings between 2% and 5% improvement in accuracy over the best performing method, not only on all the well-established few-shot benchmarks but also on more challenging scenarios,with domain shifts and larger numbers of classes.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Few-Shot Image Classification CUB 200 5-way 1-shot TIM-GD Accuracy 82.2% # 15
Few-Shot Image Classification CUB 200 5-way 5-shot TIM-GD Accuracy 90.8 # 17
Few-Shot Image Classification Mini-Imagenet 10-way (1-shot) TIM-GD Accuracy 56.1 # 3
Few-Shot Image Classification Mini-Imagenet 10-way (5-shot) TIM-GD Accuracy 72.8 # 3
Few-Shot Image Classification Mini-Imagenet 20-way (1-shot) TIM-GD Accuracy 39.3 # 1
Few-Shot Image Classification Mini-Imagenet 20-way (5-shot) TIM-GD Accuracy 59.5 # 1
Few-Shot Learning Mini-ImageNet - 5-Shot Learning TIM-GD Accuracy 87.4% # 2
Few-Shot Image Classification Mini-Imagenet 5-way (1-shot) TIM-GD Accuracy 77.80 # 15
Few-Shot Image Classification Mini-ImageNet to CUB - 5 shot learning TIM-GD Accuracy 71 # 1
Few-Shot Image Classification Tiered ImageNet 5-way (1-shot) TIM-GD Accuracy 82.1 # 9
Few-Shot Image Classification Tiered ImageNet 5-way (5-shot) TIM-GD Accuracy 89.8 # 8

Methods


No methods listed for this paper. Add relevant methods here