Search Results for author: Genghan Zhang

Found 3 papers, 2 papers with code

CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models

no code implementations • 12 Apr 2024 • Je-Yong Lee, DongHyun Lee, Genghan Zhang, Mo Tiwari, Azalia Mirhoseini

We demonstrate that CATS can be applied to various base models, including Mistral-7B and Llama2-7B, and outperforms existing sparsification techniques in downstream task performance.

Paper
Add Code

GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU

1 code implementation • 3 Apr 2024 • Zhongming Yu, Genghan Zhang, Hanxian Huang, Xin Chen, Jishen Zhao

Yet, efficient tensor-centric frameworks for GNNs remain scarce due to unique challenges and limitations encountered when implementing segment reduction in GNN contexts.

Paper
Code

Canvas: End-to-End Kernel Architecture Search in Neural Networks

1 code implementation • 16 Apr 2023 • Chenggang Zhao, Genghan Zhang, Mingyu Gao

KAS reviews NAS from a system perspective and zooms into a more fine-grained level to generate neural kernels with both high performance and good accuracy.

Neural Architecture Search

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.