TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	CIFAR-100	ViT-B-16 (ImageNet-21K-P pretrain)	Percentage correct	94.2	# 4
Multi-Label Classification	MS-COCO	TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 640)	mAP	89.8	# 13
Multi-Label Classification	MS-COCO	TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 448)	mAP	88.4	# 15
Multi-Label Classification	PASCAL VOC 2007	ViT-B-16 (ImageNet-21K pretrained)	mAP	93.1	# 15
Image Classification	Stanford Cars	TResNet-L-V2	Accuracy	96.32	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imagenet-21k-pretraining-for-the-masses/image-classification-on-stanford-cars)](https://paperswithcode.com/sota/image-classification-on-stanford-cars?p=imagenet-21k-pretraining-for-the-masses)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imagenet-21k-pretraining-for-the-masses/image-classification-on-cifar-100)](https://paperswithcode.com/sota/image-classification-on-cifar-100?p=imagenet-21k-pretraining-for-the-masses)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imagenet-21k-pretraining-for-the-masses/multi-label-classification-on-ms-coco)](https://paperswithcode.com/sota/multi-label-classification-on-ms-coco?p=imagenet-21k-pretraining-for-the-masses)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/imagenet-21k-pretraining-for-the-masses/multi-label-classification-on-pascal-voc-2007)](https://paperswithcode.com/sota/multi-label-classification-on-pascal-voc-2007?p=imagenet-21k-pretraining-for-the-masses)`

ImageNet-21K Pretraining for the Masses

22 Apr 2021 · Tal Ridnik, Emanuel Ben-Baruch, Asaf Noy, Lihi Zelnik-Manor ·

ImageNet-1K serves as the primary dataset for pretraining deep learning models for computer vision tasks. ImageNet-21K dataset, which is bigger and more diverse, is used less frequently for pretraining, mainly due to its complexity, low accessibility, and underestimation of its added value. This paper aims to close this gap, and make high-quality efficient pretraining on ImageNet-21K available for everyone. Via a dedicated preprocessing stage, utilization of WordNet hierarchical structure, and a novel training scheme called semantic softmax, we show that various models significantly benefit from ImageNet-21K pretraining on numerous datasets and tasks, including small mobile-oriented models. We also show that we outperform previous ImageNet-21K pretraining schemes for prominent new models like ViT and Mixer. Our proposed pretraining pipeline is efficient, accessible, and leads to SoTA reproducible results, from a publicly available dataset. The training code and pretrained models are available at: https://github.com/Alibaba-MIIL/ImageNet21K

PDF Abstract

Code

Add Remove Mark official

Alibaba-MIIL/ImageNet21K official

699

encounter1997/fp-detr

gregorbachmann/scaling_mlps

MS-Mind/MS-Code-01

Tasks

Add Remove

Action Recognition

Fine-Grained Image Classification

Image Classification

Multi-Label Classification

Datasets

ImageNet

MS COCO

CIFAR-100

Kinetics ImageNet-1K

Stanford Cars

iNaturalist

JFT-300M

PASCAL VOC 2007

Results from the Paper

Edit

Ranked #2 on Image Classification on Stanford Cars

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	CIFAR-100	ViT-B-16 (ImageNet-21K-P pretrain)	Percentage correct	94.2	# 4	Compare
Multi-Label Classification	MS-COCO	TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 640)	mAP	89.8	# 13	Compare
Multi-Label Classification	MS-COCO	TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 448)	mAP	88.4	# 15	Compare
Multi-Label Classification	PASCAL VOC 2007	ViT-B-16 (ImageNet-21K pretrained)	mAP	93.1	# 15	Compare
Image Classification	Stanford Cars	TResNet-L-V2	Accuracy	96.32	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

ImageNet-21K Pretraining for the Masses

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove