Training Neural Networks with Local Error Signals

20 Jan 2019  ·  Arild Nøkland, Lars Hiller Eidnes ·

Supervised training of neural networks for classification is typically performed with a global loss function. The loss function provides a gradient for the output layer, and this gradient is back-propagated to hidden layers to dictate an update direction for the weights. An alternative approach is to train the network with layer-wise loss functions. In this paper we demonstrate, for the first time, that layer-wise training can approach the state-of-the-art on a variety of image datasets. We use single-layer sub-networks and two different supervised loss functions to generate local error signals for the hidden layers, and we show that the combination of these losses help with optimization in the context of local learning. Using local errors could be a step towards more biologically plausible deep learning because the global error does not have to be transported back to hidden layers. A completely backprop free variant outperforms previously reported results among methods aiming for higher biological plausibility. Code is available https://github.com/anokland/local-loss

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Classification CIFAR-10 VGG11B(2x) + LocalLearning + CO Percentage correct 96.4 # 106
Image Classification CIFAR-100 VGG11B(3x) + LocalLearning Percentage correct 79.9 # 129
Image Classification Fashion-MNIST VGG8B(2x) + LocalLearning + CO Percentage error 4.14 # 6
Image Classification Kuzushiji-MNIST VGG8B(2x) + LocalLearning + CO Accuracy 99.01 # 3
Error 0.99 # 3
Image Classification MNIST VGG8B + LocalLearning + CO Percentage error 0.26 # 12
Image Classification STL-10 VGG8B + LocalLearning + CO Percentage correct 80.75 # 63
Image Classification SVHN VGG8B + LocalLearning + CO Percentage error 1.65 # 18

Methods


No methods listed for this paper. Add relevant methods here