PSANet

Introduced by Zhao et al. in PSANet: Point-wise Spatial Attention Network for Scene Parsing

PSANet is a semantic segmentation architecture that utilizes a Point-wise Spatial Attention (PSA) module to aggregate long-range contextual information in a flexible and adaptive manner. Each position in the feature map is connected with all other ones through self-adaptively predicted attention maps, thus harvesting various information nearby and far away. Furthermore, the authors design the bi-directional information propagation path for a comprehensive understanding of complex scenes. Each position collects information from all others to help the prediction of itself and vice versa, the information at each position can be distributed globally, assisting the prediction of all other positions. Finally, the bi-directionally aggregated contextual information is fused with local features to form the final representation of complex scenes.

The authors use ResNet as an FCN backbone for PSANet, as the Figure to the right illustrates. The proposed PSA module is then used to aggregate long-range contextual information from the local representation. It follows stage-5 in ResNet, which is the final stage of the FCN backbone. Features in stage-5 are semantically stronger. Aggregating them together leads to a more comprehensive representation of long-range context. Moreover, the spatial size of the feature map at stage-5 is smaller and can reduce computation overhead and memory consumption. An auxiliary loss branch is applied apart from the main loss.

Source: PSANet: Point-wise Spatial Attention Network for Scene Parsing

Read Paper See Code

Papers

Paper	Code	Results	Date	Stars

Tasks

Task	Papers	Share
Scene Parsing	1	50.00%
Semantic Segmentation	1	50.00%

Usage Over Time

This feature is experimental; we are continuously improving our matching algorithm.

Components

Component	Type	Add Remove
Auxiliary Classifier	Miscellaneous Components
Point-wise Spatial Attention	Semantic Segmentation Modules
ResNet	Convolutional Neural Networks
SyncBN	Normalization

Categories

Add Remove

Semantic Segmentation Models