TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Image Classification	ImageNet	SGE-ResNet101	Top 1 Accuracy	78.798%	# 742
Image Classification	ImageNet	SGE-ResNet101	Number of params	44.55M	# 701
Image Classification	ImageNet	SGE-ResNet101	GFLOPs	7.858	# 264
Image Classification	ImageNet	SGE-ResNet50	Top 1 Accuracy	77.584%	# 803
Image Classification	ImageNet	SGE-ResNet50	Number of params	25.56M	# 597
Image Classification	ImageNet	SGE-ResNet50	GFLOPs	4.127	# 200

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/spatial-group-wise-enhance-improving-semantic/image-classification-on-imagenet)](https://paperswithcode.com/sota/image-classification-on-imagenet?p=spatial-group-wise-enhance-improving-semantic)`

Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks

23 May 2019 · Xiang Li, Xiaolin Hu, Jian Yang ·

The Convolutional Neural Networks (CNNs) generate the feature representation of complex objects by collecting hierarchical and different parts of semantic sub-features. These sub-features can usually be distributed in grouped form in the feature vector of each layer, representing various semantic entities. However, the activation of these sub-features is often spatially affected by similar patterns and noisy backgrounds, resulting in erroneous localization and identification. We propose a Spatial Group-wise Enhance (SGE) module that can adjust the importance of each sub-feature by generating an attention factor for each spatial location in each semantic group, so that every individual group can autonomously enhance its learnt expression and suppress possible noise. The attention factors are only guided by the similarities between the global and local feature descriptors inside each group, thus the design of SGE module is extremely lightweight with \emph{almost no extra parameters and calculations}. Despite being trained with only category supervisions, the SGE component is extremely effective in highlighting multiple active areas with various high-order semantics (such as the dog's eyes, nose, etc.). When integrated with popular CNN backbones, SGE can significantly boost the performance of image recognition tasks. Specifically, based on ResNet50 backbones, SGE achieves 1.2\% Top-1 accuracy improvement on the ImageNet benchmark and 1.0$\sim$2.0\% AP gain on the COCO benchmark across a wide range of detectors (Faster/Mask/Cascade RCNN and RetinaNet). Codes and pretrained models are available at https://github.com/implus/PytorchInsight.

PDF Abstract

Code

Add Remove Mark official

implus/PytorchInsight official

857

mindspore-courses/External-Attentio…

whai362/PytorchInsight

Tasks

Add Remove

Image Classification

Object Detection

Datasets

ImageNet

MS COCO

Results from the Paper

Edit

Ranked #739 on Image Classification on ImageNet

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Image Classification	ImageNet	SGE-ResNet101	Top 1 Accuracy	78.798%	# 742	Compare
			Number of params	44.55M	# 701	Compare
			GFLOPs	7.858	# 264	Compare
Image Classification	ImageNet	SGE-ResNet50	Top 1 Accuracy	77.584%	# 803	Compare
			Number of params	25.56M	# 597	Compare
			GFLOPs	4.127	# 200	Compare

Methods

Add Remove

1x1 Convolution • Average Pooling • Batch Normalization • Bottleneck Residual Block • Cascade R-CNN • Convolution • Dot-Product Attention • Faster R-CNN • Focal Loss • FPN • Global Average Pooling • Kaiming Initialization • Mask R-CNN • Max Pooling • Random Horizontal Flip • Random Resized Crop • ReLU • Residual Block • Residual Connection • ResNet • RetinaNet • RoIAlign • RoIPool • RPN • SGD with Momentum • Sigmoid Activation • Softmax • Spatial Group-wise Enhance • Step Decay • Weight Decay

Edit Social Preview

Spatial Group-wise Enhance: Improving Semantic Feature Learning in Convolutional Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove