TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
3D Object Detection	KITTI Cars Easy val	PVCNN	AP	84.02	# 7
3D Object Detection	KITTI Cars Hard val	PVCNN	AP	63.81	# 7
3D Object Detection	KITTI Cars Moderate val	PVCNN	AP	71.54	# 8
3D Object Detection	KITTI Cyclist Easy val	PVCNN	AP	81.4	# 2
3D Object Detection	KITTI Cyclist Hard val	PVCNN	AP	56.24	# 2
3D Object Detection	KITTI Cyclist Moderate val	PVCNN	AP	59.97	# 2
3D Object Detection	KITTI Pedestrian Easy val	PVCNN	AP	73.2	# 1
3D Object Detection	KITTI Pedestrian Hard val	PVCNN	AP	56.78	# 1
3D Object Detection	KITTI Pedestrian Moderate val	PVCNN	AP	64.71	# 1
3D Semantic Segmentation	S3DIS	PVCNN++	mIoU (6-Fold)	58.98	# 5
3D Part Segmentation	ShapeNet-Part	PVCNN volumetric	Instance Average IoU	86.2	# 29

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-pedestrian-easy)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-pedestrian-easy?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-pedestrian-hard)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-pedestrian-hard?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-pedestrian)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-pedestrian?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cyclist-easy-val)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cyclist-easy-val?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cyclist-hard-val)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cyclist-hard-val?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cyclist-moderate)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cyclist-moderate?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-semantic-segmentation-on-s3dis)](https://paperswithcode.com/sota/3d-semantic-segmentation-on-s3dis?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cars-easy-val)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cars-easy-val?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cars-hard-val)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cars-hard-val?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-object-detection-on-kitti-cars-moderate-1)](https://paperswithcode.com/sota/3d-object-detection-on-kitti-cars-moderate-1?p=point-voxel-cnn-for-efficient-3d-deep)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/point-voxel-cnn-for-efficient-3d-deep/3d-part-segmentation-on-shapenet-part)](https://paperswithcode.com/sota/3d-part-segmentation-on-shapenet-part?p=point-voxel-cnn-for-efficient-3d-deep)`

Point-Voxel CNN for Efficient 3D Deep Learning

NeurIPS 2019 · Zhijian Liu, Haotian Tang, Yujun Lin, Song Han ·

We present Point-Voxel CNN (PVCNN) for efficient, fast 3D deep learning. Previous work processes 3D data using either voxel-based or point-based NN models. However, both approaches are computationally inefficient. The computation cost and memory footprints of the voxel-based models grow cubically with the input resolution, making it memory-prohibitive to scale up the resolution. As for point-based networks, up to 80% of the time is wasted on structuring the sparse data which have rather poor memory locality, not on the actual feature extraction. In this paper, we propose PVCNN that represents the 3D input data in points to reduce the memory consumption, while performing the convolutions in voxels to reduce the irregular, sparse data access and improve the locality. Our PVCNN model is both memory and computation efficient. Evaluated on semantic and part segmentation datasets, it achieves much higher accuracy than the voxel-based baseline with 10x GPU memory reduction; it also outperforms the state-of-the-art point-based models with 7x measured speedup on average. Remarkably, the narrower version of PVCNN achieves 2x speedup over PointNet (an extremely efficient model) on part and scene segmentation benchmarks with much higher accuracy. We validate the general effectiveness of PVCNN on 3D object detection: by replacing the primitives in Frustrum PointNet with PVConv, it outperforms Frustrum PointNet++ by 2.4% mAP on average with 1.5x measured speedup and GPU memory reduction.

PDF Abstract NeurIPS 2019 PDF NeurIPS 2019 Abstract

Code

Add Remove Mark official

mit-han-lab/pvcnn official

617

isl-org/Open3D-ML

1,660

zghera/voxel-tf-ops

zghera/pvcnn-tf

Tasks

Add Remove

3D Object Detection

3D Semantic Segmentation

object-detection

Object Detection

Scene Segmentation

Datasets

KITTI

ShapeNet

S3DIS

Results from the Paper

Edit

Ranked #1 on 3D Object Detection on KITTI Pedestrian Hard val

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
3D Object Detection	KITTI Cars Easy val	PVCNN	AP	84.02	# 7	Compare
3D Object Detection	KITTI Cars Hard val	PVCNN	AP	63.81	# 7	Compare
3D Object Detection	KITTI Cars Moderate val	PVCNN	AP	71.54	# 8	Compare
3D Object Detection	KITTI Cyclist Easy val	PVCNN	AP	81.4	# 2	Compare
3D Object Detection	KITTI Cyclist Hard val	PVCNN	AP	56.24	# 2	Compare
3D Object Detection	KITTI Cyclist Moderate val	PVCNN	AP	59.97	# 2	Compare
3D Object Detection	KITTI Pedestrian Easy val	PVCNN	AP	73.2	# 1	Compare
3D Object Detection	KITTI Pedestrian Hard val	PVCNN	AP	56.78	# 1	Compare
3D Object Detection	KITTI Pedestrian Moderate val	PVCNN	AP	64.71	# 1	Compare
3D Semantic Segmentation	S3DIS	PVCNN++	mIoU (6-Fold)	58.98	# 5	Compare
3D Part Segmentation	ShapeNet-Part	PVCNN volumetric	Instance Average IoU	86.2	# 29	Compare

Methods

Add Remove

PointNet

Edit Social Preview

Point-Voxel CNN for Efficient 3D Deep Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove