TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth)	Validation mIoU	39.19	# 2
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth)	Test mIoU	35.92	# 1
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth+Normal)	Validation mIoU	39.26	# 1
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth+Normal)	Test mIoU	35.52	# 3
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Normal)	Validation mIoU	38.91	# 3
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Normal)	Test mIoU	35.77	# 2
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB Only)	Validation mIoU	35.15	# 4
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB Only)	Test mIoU	31.3	# 4
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB Only)	mIoU	52.87%	# 9
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB Only)	mAcc	63.96	# 9
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth+Normal)	mIoU	59.43%	# 2
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth+Normal)	mAcc	69.03	# 2
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	mIoU	60.6%	# 1
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	mAcc	70.68	# 1
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Normal)	mIoU	58.24%	# 3
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Normal)	mAcc	68.79	# 3
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth)	mIoU	55.49%	# 5
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth)	mAcc	68.57	# 4
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Normal)	Validation mIoU	74.38	# 2
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Normal)	Test mIoU	71	# 2
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth)	Validation mIoU	73.78	# 3
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth)	Test mIoU	70.17	# 3
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB Only)	Validation mIoU	71.94	# 4
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB Only)	Test mIoU	68.34	# 4
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth+Normal)	Validation mIoU	75.86	# 1
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth+Normal)	Test mIoU	71.97	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-frame-semantic-segmentation-using/semantic-segmentation-on-matterport3d)](https://paperswithcode.com/sota/semantic-segmentation-on-matterport3d?p=single-frame-semantic-segmentation-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-frame-semantic-segmentation-using/semantic-segmentation-on-stanford2d3d-1)](https://paperswithcode.com/sota/semantic-segmentation-on-stanford2d3d-1?p=single-frame-semantic-segmentation-using)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/single-frame-semantic-segmentation-using/semantic-segmentation-on-structured3d)](https://paperswithcode.com/sota/semantic-segmentation-on-structured3d?p=single-frame-semantic-segmentation-using)`

Single Frame Semantic Segmentation Using Multi-Modal Spherical Images

18 Aug 2023 · Suresh Guttikonda, Jason Rambach ·

In recent years, the research community has shown a lot of interest to panoramic images that offer a 360-degree directional perspective. Multiple data modalities can be fed, and complimentary characteristics can be utilized for more robust and rich scene interpretation based on semantic segmentation, to fully realize the potential. Existing research, however, mostly concentrated on pinhole RGB-X semantic segmentation. In this study, we propose a transformer-based cross-modal fusion architecture to bridge the gap between multi-modal fusion and omnidirectional scene perception. We employ distortion-aware modules to address extreme object deformations and panorama distortions that result from equirectangular representation. Additionally, we conduct cross-modal interactions for feature rectification and information exchange before merging the features in order to communicate long-range contexts for bi-modal and tri-modal feature streams. In thorough tests using combinations of four different modality types in three indoor panoramic-view datasets, our technique achieved state-of-the-art mIoU performance: 60.60% on Stanford2D3DS (RGB-HHA), 71.97% Structured3D (RGB-D-N), and 35.92% Matterport3D (RGB-D). We plan to release all codes and trained models soon.

PDF Abstract

Code

Add Remove Mark official

sguttikon/SFSS-MMSI official

Tasks

Add Remove

Semantic Segmentation

Datasets

Matterport3D

2D-3D-S

Structured3D

Results from the Paper

Edit

Ranked #1 on Semantic Segmentation on Matterport3D

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth)	Validation mIoU	39.19	# 2	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth)	Test mIoU	35.92	# 1	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth+Normal)	Validation mIoU	39.26	# 1	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Depth+Normal)	Test mIoU	35.52	# 3	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Normal)	Validation mIoU	38.91	# 3	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB+Normal)	Test mIoU	35.77	# 2	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB Only)	Validation mIoU	35.15	# 4	Compare
Semantic Segmentation	Matterport3D	SFSS-MMSI (RGB Only)	Test mIoU	31.3	# 4	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB Only)	mIoU	52.87%	# 9	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB Only)	mAcc	63.96	# 9	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth+Normal)	mIoU	59.43%	# 2	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth+Normal)	mAcc	69.03	# 2	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	mIoU	60.6%	# 1	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+HHA)	mAcc	70.68	# 1	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Normal)	mIoU	58.24%	# 3	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Normal)	mAcc	68.79	# 3	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth)	mIoU	55.49%	# 5	Compare
Semantic Segmentation	Stanford2D3D Panoramic	SFSS-MMSI (RGB+Depth)	mAcc	68.57	# 4	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Normal)	Validation mIoU	74.38	# 2	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Normal)	Test mIoU	71	# 2	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth)	Validation mIoU	73.78	# 3	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth)	Test mIoU	70.17	# 3	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB Only)	Validation mIoU	71.94	# 4	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB Only)	Test mIoU	68.34	# 4	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth+Normal)	Validation mIoU	75.86	# 1	Compare
Semantic Segmentation	Structured3D	SFSS-MMSI (RGB+Depth+Normal)	Test mIoU	71.97	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Single Frame Semantic Segmentation Using Multi-Modal Spherical Images

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove