Search Results for author: Xuhao Chen

Found 2 papers, 1 papers with code

Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs

1 code implementation28 Feb 2018 Xuhao Chen

Weight pruning can compress DNN models by removing redundant parameters in the networks, but it brings sparsity in the weight matrix, and therefore makes the computation inefficient on GPUs.

Cannot find the paper you are looking for? You can Submit a new open access paper.