no code implementations • 12 Apr 2024 • Je-Yong Lee, DongHyun Lee, Genghan Zhang, Mo Tiwari, Azalia Mirhoseini
We demonstrate that CATS can be applied to various base models, including Mistral-7B and Llama2-7B, and outperforms existing sparsification techniques in downstream task performance.
1 code implementation • 3 Apr 2024 • Zhongming Yu, Genghan Zhang, Hanxian Huang, Xin Chen, Jishen Zhao
Yet, efficient tensor-centric frameworks for GNNs remain scarce due to unique challenges and limitations encountered when implementing segment reduction in GNN contexts.
1 code implementation • 16 Apr 2023 • Chenggang Zhao, Genghan Zhang, Mingyu Gao
KAS reviews NAS from a system perspective and zooms into a more fine-grained level to generate neural kernels with both high performance and good accuracy.