no code implementations • 11 Jan 2019 • Thiam Khean Hah, Yeong Tat Liew, Jason Ong
The projected performance on a multichip persistent implementation of all Resnet50 convolution layers is 10k im/s/chip at batch size 2.