Very Deep Convolutional Networks for Large-Scale Image Recognition

4 Sep 2014  ·  Karen Simonyan, Andrew Zisserman ·

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Face Anti-Spoofing CelebA-Spoof-Enroll5 VGG16 AUC 98.0 # 3
Image Classification ImageNet ReaL VGG-16 Accuracy 79.01% # 52
Image Classification ImageNet ReaL VGG-16 BN Accuracy 80.60% # 51
Classification InDL VGG16 Average Recall 92.86% # 2
Face Anti-Spoofing SiW-Enroll5 VGG16 AUC 97.8 # 5
Domain Generalization VizWiz-Classification VGG-11 BN Accuracy - All Images 32.9 # 80
Accuracy - Corrupted Images 25.8 # 81
Accuracy - Clean Images 37.1 # 80
Domain Generalization VizWiz-Classification VGG-13 BN Accuracy - All Images 33.7 # 78
Accuracy - Corrupted Images 28.3 # 72
Accuracy - Clean Images 38.4 # 78
Domain Generalization VizWiz-Classification VGG-19 Accuracy - All Images 34.7 # 73
Accuracy - Corrupted Images 29 # 68
Accuracy - Clean Images 39.3 # 74
Domain Generalization VizWiz-Classification VGG-16 Accuracy - All Images 34.7 # 73
Accuracy - Corrupted Images 28.5 # 70
Accuracy - Clean Images 39.5 # 71
Domain Generalization VizWiz-Classification VGG-19 BN Accuracy - All Images 36.2 # 62
Accuracy - Corrupted Images 29.4 # 66
Accuracy - Clean Images 40.8 # 61
Domain Generalization VizWiz-Classification VGG-16 BN Accuracy - All Images 36.7 # 57
Accuracy - Corrupted Images 31.1 # 52
Accuracy - Clean Images 41.1 # 59
Domain Generalization VizWiz-Classification VGG-13 Accuracy - All Images 32.4 # 82
Accuracy - Corrupted Images 26.4 # 80
Accuracy - Clean Images 36.5 # 82
Domain Generalization VizWiz-Classification VGG-11 Accuracy - All Images 31.5 # 83
Accuracy - Corrupted Images 25.2 # 82
Accuracy - Clean Images 36.1 # 83
Classification XImageNet-12 VGG-16 Robustness Score 0.8845 # 3

Results from Other Papers


Task Dataset Model Metric Name Metric Value Rank Source Paper Compare
Image-to-Image Translation GTAV-to-Cityscapes Labels VGG16 60.3 mIoU 41.3 # 21

Methods