TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	DADA-seg	MobileNetV2	mIoU	16.05	# 27
Image Classification	ImageNet	MobileNetV2	Top 1 Accuracy	72%	# 929
Image Classification	ImageNet	MobileNetV2	Number of params	3.4M	# 372
Image Classification	ImageNet	MobileNetV2	GFLOPs	0.600	# 65
Image Classification	ImageNet	MobileNetV2 (1.4)	Top 1 Accuracy	74.7%	# 898
Image Classification	ImageNet	MobileNetV2 (1.4)	Number of params	6.9M	# 451
Image Classification	ImageNet	MobileNetV2 (1.4)	GFLOPs	1.170	# 112
Retinal OCT Disease Classification	OCT2017	MobileNet-v2	Acc	98.5	# 7
Retinal OCT Disease Classification	OCT2017	MobileNet-v2	Sensitivity	99.4	# 4
Retinal OCT Disease Classification	Srinivasan2014	MobileNet-v2	Acc	97.46	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilenetv2-inverted-residuals-and-linear/retinal-oct-disease-classification-on-oct2017)](https://paperswithcode.com/sota/retinal-oct-disease-classification-on-oct2017?p=mobilenetv2-inverted-residuals-and-linear)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilenetv2-inverted-residuals-and-linear/retinal-oct-disease-classification-on)](https://paperswithcode.com/sota/retinal-oct-disease-classification-on?p=mobilenetv2-inverted-residuals-and-linear)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilenetv2-inverted-residuals-and-linear/semantic-segmentation-on-dada-seg)](https://paperswithcode.com/sota/semantic-segmentation-on-dada-seg?p=mobilenetv2-inverted-residuals-and-linear)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilenetv2-inverted-residuals-and-linear/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=mobilenetv2-inverted-residuals-and-linear)`

MobileNetV2: Inverted Residuals and Linear Bottlenecks

CVPR 2018 · Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen ·

In this paper we describe a new mobile architecture, MobileNetV2, that improves the state of the art performance of mobile models on multiple tasks and benchmarks as well as across a spectrum of different model sizes. We also describe efficient ways of applying these mobile models to object detection in a novel framework we call SSDLite. Additionally, we demonstrate how to build mobile semantic segmentation models through a reduced form of DeepLabv3 which we call Mobile DeepLabv3. The MobileNetV2 architecture is based on an inverted residual structure where the input and output of the residual block are thin bottleneck layers opposite to traditional residual models which use expanded representations in the input an MobileNetV2 uses lightweight depthwise convolutions to filter features in the intermediate expansion layer. Additionally, we find that it is important to remove non-linearities in the narrow layers in order to maintain representational power. We demonstrate that this improves performance and provide an intuition that led to this design. Finally, our approach allows decoupling of the input/output domains from the expressiveness of the transformation, which provides a convenient framework for further analysis. We measure our performance on Imagenet classification, COCO object detection, VOC image segmentation. We evaluate the trade-offs between accuracy, and number of operations measured by multiply-adds (MAdd), as well as the number of parameters

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Code

Add Remove Mark official

tensorflow/models

76,594

tensorflow/models

76,591

tensorflow/models

↳ Quickstart in

Colab

76,591

tensorflow/models

76,591

PaddlePaddle/PaddleOCR

38,458

See all 148 implementations

Tasks

Add Remove

Image Classification

Image Segmentation

Object Detection

Person Re-Identification

Retinal OCT Disease Classification

Semantic Segmentation

Datasets

ImageNet

MS COCO

ssd

DADA-seg

Results from the Paper

Edit

Ranked #7 on Retinal OCT Disease Classification on OCT2017

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	DADA-seg	MobileNetV2	mIoU	16.05	# 27	Compare
Image Classification	ImageNet	MobileNetV2	Top 1 Accuracy	72%	# 929	Compare
			Number of params	3.4M	# 372	Compare
			GFLOPs	0.600	# 65	Compare
Image Classification	ImageNet	MobileNetV2 (1.4)	Top 1 Accuracy	74.7%	# 898	Compare
			Number of params	6.9M	# 451	Compare
			GFLOPs	1.170	# 112	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Compare
Retinal OCT Disease Classification	OCT2017	MobileNet-v2	Acc	98.5	# 7	See all
Retinal OCT Disease Classification	OCT2017	MobileNet-v2	Sensitivity	99.4	# 4	See all
Retinal OCT Disease Classification	Srinivasan2014	MobileNet-v2	Acc	97.46	# 7	See all

Methods

Add Remove

1x1 Convolution • ASPP • Average Pooling • Batch Normalization • Convolution • DeepLabv3 • Depthwise Convolution • Depthwise Separable Convolution • Dilated Convolution • Dropout • Inverted Residual Block • MobileNetV2 • Pointwise Convolution • ReLU • ReLU6 • Residual Block • Residual Connection • RMSProp • Spatial Pyramid Pooling • Weight Decay

Edit Social Preview

MobileNetV2: Inverted Residuals and Linear Bottlenecks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Results from Other Papers

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove