Generalizing Pooling Functions in Convolutional Neural Networks: Mixed, Gated, and Tree

30 Sep 2015  ·  Chen-Yu Lee, Patrick W. Gallagher, Zhuowen Tu ·

We seek to improve deep neural networks by generalizing the pooling operations that play a central role in current architectures. We pursue a careful exploration of approaches to allow pooling to learn and to adapt to complex and variable patterns. The two primary directions lie in (1) learning a pooling function via (two strategies of) combining of max and average pooling, and (2) learning a pooling function in the form of a tree-structured fusion of pooling filters that are themselves learned. In our experiments every generalized pooling operation we explore improves performance when used in place of average or max pooling. We experimentally demonstrate that the proposed pooling operations provide a boost in invariance properties relative to conventional pooling and set the state of the art on several widely adopted benchmark datasets; they are also easy to implement, and can be applied within various deep neural network architectures. These benefits come with only a light increase in computational overhead during training and a very modest increase in the number of model parameters.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Image Classification CIFAR-10 Tree+Max-Avg pooling Percentage correct 94.0 # 149
Image Classification CIFAR-100 Tree+Max-Avg pooling Percentage correct 67.6 # 173
Image Classification MNIST Tree+Max-Avg pooling Percentage error 0.3 # 17
Image Classification SVHN Tree+Max-Avg pooling Percentage error 1.7 # 19

Methods


No methods listed for this paper. Add relevant methods here