TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Long-tail Learning	CIFAR-100-LT (ρ=100)	RIDE+distill	Error Rate	50.9	# 31
Long-tail Learning	CIFAR-100-LT (ρ=100)	RIDE	Error Rate	52	# 34
Long-tail Learning	ImageNet-LT	RIDE (ResNeXt-50)	Top-1 Accuracy	56.4	# 29
Long-tail Learning	ImageNet-LT	RIDE (ResNet-50)	Top-1 Accuracy	54.9	# 35
Long-tail Learning	iNaturalist 2018	RIDE	Top-1 Accuracy	72.2%	# 22
Image Classification	iNaturalist 2018	RIDE (ResNet-50)	Top-1 Accuracy	72.2%	# 29

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-tailed-recognition-by-routing-diverse-1/long-tail-learning-on-inaturalist-2018)](https://paperswithcode.com/sota/long-tail-learning-on-inaturalist-2018?p=long-tailed-recognition-by-routing-diverse-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-tailed-recognition-by-routing-diverse-1/long-tail-learning-on-imagenet-lt)](https://paperswithcode.com/sota/long-tail-learning-on-imagenet-lt?p=long-tailed-recognition-by-routing-diverse-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-tailed-recognition-by-routing-diverse-1/image-classification-on-inaturalist-2018)](https://paperswithcode.com/sota/image-classification-on-inaturalist-2018?p=long-tailed-recognition-by-routing-diverse-1)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/long-tailed-recognition-by-routing-diverse-1/long-tail-learning-on-cifar-100-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-100?p=long-tailed-recognition-by-routing-diverse-1)`

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

ICLR 2021 · Xudong Wang, Long Lian, Zhongqi Miao, Ziwei Liu, Stella X. Yu ·

Natural data are often long-tail distributed over semantic classes. Existing recognition methods tackle this imbalanced classification by placing more emphasis on the tail data, through class re-balancing/re-weighting or ensembling over different data groups, resulting in increased tail accuracies but reduced head accuracies. We take a dynamic view of the training data and provide a principled model bias and variance analysis as the training data fluctuates: Existing long-tail classifiers invariably increase the model variance and the head-tail model bias gap remains large, due to more and larger confusion with hard negatives for the tail. We propose a new long-tailed classifier called RoutIng Diverse Experts (RIDE). It reduces the model variance with multiple experts, reduces the model bias with a distribution-aware diversity loss, reduces the computational cost with a dynamic expert routing module. RIDE outperforms the state-of-the-art by 5% to 7% on CIFAR100-LT, ImageNet-LT and iNaturalist 2018 benchmarks. It is also a universal framework that is applicable to various backbone networks, long-tailed algorithms, and training mechanisms for consistent performance gains. Our code is available at: https://github.com/frank-xwang/RIDE-LongTailRecognition.

PDF Abstract ICLR 2021 PDF ICLR 2021 Abstract

Code

Add Remove Mark official

frank-xwang/RIDE-LongTailRecognition official

251

beierzhu/xerm

Tasks

Add Remove

Image Classification

imbalanced classification

Long-tail Learning

Datasets

ImageNet

CIFAR-100

iNaturalist ImageNet-LT CIFAR100-LT

Results from the Paper

Edit

Ranked #22 on Long-tail Learning on iNaturalist 2018

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Long-tail Learning	CIFAR-100-LT (ρ=100)	RIDE+distill	Error Rate	50.9	# 31	Compare
Long-tail Learning	ImageNet-LT	RIDE (ResNeXt-50)	Top-1 Accuracy	56.4	# 29	Compare
Long-tail Learning	iNaturalist 2018	RIDE	Top-1 Accuracy	72.2%	# 22	Compare
Image Classification	iNaturalist 2018	RIDE (ResNet-50)	Top-1 Accuracy	72.2%	# 29	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
Long-tail Learning	CIFAR-100-LT (ρ=100)	RIDE	Error Rate	52	# 34		See all
Long-tail Learning	ImageNet-LT	RIDE (ResNet-50)	Top-1 Accuracy	54.9	# 35		See all

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit