2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds

10 Jul 2022  ·  Xu Yan, Jiantao Gao, Chaoda Zheng, Chao Zheng, Ruimao Zhang, Shenghui Cui, Zhen Li ·

As camera and LiDAR sensors capture complementary information used in autonomous driving, great efforts have been made to develop semantic segmentation algorithms through multi-modality data fusion. However, fusion-based approaches require paired data, i.e., LiDAR point clouds and camera images with strict point-to-pixel mappings, as the inputs in both training and inference, which seriously hinders their application in practical scenarios. Thus, in this work, we propose the 2D Priors Assisted Semantic Segmentation (2DPASS), a general training scheme, to boost the representation learning on point clouds, by fully taking advantage of 2D images with rich appearance. In practice, by leveraging an auxiliary modal fusion and multi-scale fusion-to-single knowledge distillation (MSFSKD), 2DPASS acquires richer semantic and structural information from the multi-modal data, which are then online distilled to the pure 3D network. As a result, equipped with 2DPASS, our baseline shows significant improvement with only point cloud inputs. Specifically, it achieves the state-of-the-arts on two large-scale benchmarks (i.e. SemanticKITTI and NuScenes), including top-1 results in both single and multiple scan(s) competitions of SemanticKITTI.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
LIDAR Semantic Segmentation nuScenes 2DPASS test mIoU 0.81 # 6
Robust 3D Semantic Segmentation nuScenes-C 2DPASS mean Corruption Error (mCE) 98.56% # 4
3D Semantic Segmentation SemanticKITTI 2DPASS test mIoU 72.9% # 6
val mIoU 69.3% # 7
Robust 3D Semantic Segmentation SemanticKITTI-C 2DPASS mean Corruption Error (mCE) 106.14% # 11

Methods