no code implementations • 5 Dec 2023 • Hengrui Zhang, August Ning, Rohan Prabhakar, David Wentzlaff
With the large hardware needed to simply run LLM inference, evaluating different hardware designs becomes a new bottleneck.
Language Modelling Large Language Model +1