TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Stereo Depth Estimation	KITTI2015	2D-MobileStereoNet	three pixel error	2.67	# 3
Stereo Depth Estimation	KITTI2015	3D-MobileStereoNet	three pixel error	1.69	# 1
Stereo Depth Estimation	sceneflow	3D-MobileStereoNet	Average End-Point Error	0.80	# 1
Stereo Depth Estimation	sceneflow	3D-MobileStereoNet	EPE	0.80	# 1
Stereo Depth Estimation	sceneflow	2D-MobileStereoNet	Average End-Point Error	1.14	# 3
Stereo Depth Estimation	sceneflow	2D-MobileStereoNet	EPE	1.14	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilestereonet-towards-lightweight-deep/stereo-depth-estimation-on-kitti2015)](https://paperswithcode.com/sota/stereo-depth-estimation-on-kitti2015?p=mobilestereonet-towards-lightweight-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mobilestereonet-towards-lightweight-deep/stereo-depth-estimation-on-sceneflow)](https://paperswithcode.com/sota/stereo-depth-estimation-on-sceneflow?p=mobilestereonet-towards-lightweight-deep)`

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

22 Aug 2021 · Faranak Shamsafar, Samuel Woerz, Rafia Rahim, Andreas Zell ·

Recent methods in stereo matching have continuously improved the accuracy using deep models. This gain, however, is attained with a high increase in computation cost, such that the network may not fit even on a moderate GPU. This issue raises problems when the model needs to be deployed on resource-limited devices. For this, we propose two light models for stereo vision with reduced complexity and without sacrificing accuracy. Depending on the dimension of cost volume, we design a 2D and a 3D model with encoder-decoders built from 2D and 3D convolutions, respectively. To this end, we leverage 2D MobileNet blocks and extend them to 3D for stereo vision application. Besides, a new cost volume is proposed to boost the accuracy of the 2D model, making it performing close to 3D networks. Experiments show that the proposed 2D/3D networks effectively reduce the computational expense (27%/95% and 72%/38% fewer parameters/operations in 2D and 3D models, respectively) while upholding the accuracy. Our code is available at https://github.com/cogsys-tuebingen/mobilestereonet.

PDF Abstract

Code

Add Remove Mark official

cogsys-tuebingen/mobilestereonet official

213

ibaiGorordo/ONNX-MobileStereoNet

ibaiGorordo/TFLite-MobileStereoNet

Tasks

Add Remove

Depth Estimation

Disparity Estimation

Stereo Depth Estimation

Stereo Matching

Datasets

KITTI

Results from the Paper

Edit

Ranked #1 on Stereo Depth Estimation on sceneflow

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Stereo Depth Estimation	KITTI2015	2D-MobileStereoNet	three pixel error	2.67	# 3	Compare
Stereo Depth Estimation	KITTI2015	3D-MobileStereoNet	three pixel error	1.69	# 1	Compare
Stereo Depth Estimation	sceneflow	3D-MobileStereoNet	Average End-Point Error	0.80	# 1	Compare
Stereo Depth Estimation	sceneflow	3D-MobileStereoNet	EPE	0.80	# 1	Compare
Stereo Depth Estimation	sceneflow	2D-MobileStereoNet	Average End-Point Error	1.14	# 3	Compare
Stereo Depth Estimation	sceneflow	2D-MobileStereoNet	EPE	1.14	# 2	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Convolution • Dense Connections • Depthwise Convolution • Depthwise Separable Convolution • Global Average Pooling • Inverted Residual Block • MobileNetV1 • MobileNetV2 • Pointwise Convolution • ReLU • Softmax

Edit Social Preview

MobileStereoNet: Towards Lightweight Deep Networks for Stereo Matching

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove