no code implementations • 30 Apr 2021 • Ivan Lazarevich, Alexander Kozlov, Nikita Malinin
We present a post-training weight pruning method for deep neural networks that achieves accuracy levels tolerable for the production setting and that is sufficiently fast to be run on commodity hardware such as desktop CPUs or edge devices.