TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	S3DIS Area5	PatchFormer	mIoU	67.3	# 32
Semantic Segmentation	S3DIS Area5	PatchFormer	Number of params	N/A	# 2
Semantic Segmentation	ShapeNet	PatchFormer	Mean IoU	86.5%	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/patchformer-a-versatile-3d-transformer-based/semantic-segmentation-on-shapenet)](https://paperswithcode.com/sota/semantic-segmentation-on-shapenet?p=patchformer-a-versatile-3d-transformer-based)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/patchformer-a-versatile-3d-transformer-based/semantic-segmentation-on-s3dis-area5)](https://paperswithcode.com/sota/semantic-segmentation-on-s3dis-area5?p=patchformer-a-versatile-3d-transformer-based)`

PatchFormer: An Efficient Point Transformer with Patch Attention

CVPR 2022 · Zhang Cheng, Haocheng Wan, Xinyi Shen, Zizhao Wu ·

The point cloud learning community witnesses a modeling shift from CNNs to Transformers, where pure Transformer architectures have achieved top accuracy on the major learning benchmarks. However, existing point Transformers are computationally expensive since they need to generate a large attention map, which has quadratic complexity (both in space and time) with respect to input size. To solve this shortcoming, we introduce Patch ATtention (PAT) to adaptively learn a much smaller set of bases upon which the attention maps are computed. By a weighted summation upon these bases, PAT not only captures the global shape context but also achieves linear complexity to input size. In addition, we propose a lightweight Multi-Scale aTtention (MST) block to build attentions among features of different scales, providing the model with multi-scale features. Equipped with the PAT and MST, we construct our neural architecture called PatchFormer that integrates both modules into a joint framework for point cloud learning. Extensive experiments demonstrate that our network achieves comparable accuracy on general point cloud learning tasks with 9.2x speed-up than previous point Transformers.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Semantic Segmentation

Datasets

ShapeNet

ModelNet

S3DIS

Results from the Paper

Edit

Ranked #1 on Semantic Segmentation on ShapeNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	S3DIS Area5	PatchFormer	mIoU	67.3	# 32	Compare
Semantic Segmentation	S3DIS Area5	PatchFormer	Number of params	N/A	# 2	Compare
Semantic Segmentation	ShapeNet	PatchFormer	Mean IoU	86.5%	# 1	Compare

Methods

Add Remove

Absolute Position Encodings • Adam • BPE • Dense Connections • Dropout • Label Smoothing • Layer Normalization • Linear Layer • Multi-Head Attention • Position-Wise Feed-Forward Layer • Residual Connection • Scaled Dot-Product Attention • Softmax • Transformer

Edit Social Preview

PatchFormer: An Efficient Point Transformer with Patch Attention

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove