Search Results for author: Kezhao Huang

PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR

To accelerate DNN computation, tensor compilers are proposed to generate efficient code on different domain-specific accelerators.

Paper
Add Code

A key performance bottleneck when training graph neural network (GNN) models on large, real-world graphs is loading node features onto a GPU.

Paper
Add Code

Boosting the runtime performance of deep neural networks (DNNs) is critical due to their wide adoption in real-world tasks.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.