no code implementations • 2 Dec 2022 • Zhiying Xu, Hongding Peng, Wei Wang
Traditional deep learning compilers rely on heuristics for subgraph generation, which impose extra constraints on graph optimization, e. g., each subgraph can only contain at most one complex operator.
no code implementations • 22 Oct 2022 • Zhiying Xu, Jiafan Xu, Hongding Peng, Wei Wang, Xiaoliang Wang, Haoran Wan, Haipeng Dai, Yixu Xu, Hao Cheng, Kun Wang, Guihai Chen
Deep learning models rely on highly optimized tensor libraries for efficient inference on heterogeneous hardware.