no code implementations • 5 Jan 2024 • Adnan Hoque, Less Wright, Chih-Chieh Yang, Mudhakar Srivatsa, Raghu Ganti
Our implementation shows improvement for the type of skinny matrix-matrix multiplications found in foundation model inference workloads.
no code implementations • 21 Apr 2023 • Yanli Zhao, Andrew Gu, Rohan Varma, Liang Luo, Chien-chin Huang, Min Xu, Less Wright, Hamid Shojanazeri, Myle Ott, Sam Shleifer, Alban Desmaison, Can Balioglu, Pritam Damania, Bernard Nguyen, Geeta Chauhan, Yuchen Hao, Ajit Mathews, Shen Li
It is widely acknowledged that large models have the potential to deliver superior performance across a broad range of domains.
2 code implementations • 25 Jun 2021 • Less Wright, Nestor Demeure
As optimizers are critical to the performances of neural networks, every year a large number of papers innovating on the subject are published.