Search Results for author: Pranav Pulijala

Found 1 papers, 1 papers with code

LookupFFN: Making Transformers Compute-lite for CPU inference

1 code implementation12 Mar 2024 Zhanpeng Zeng, Michael Davies, Pranav Pulijala, Karthikeyan Sankaralingam, Vikas Singh

While GPU clusters are the de facto choice for training large deep neural network (DNN) models today, several reasons including ease of workflow, security and cost have led to efforts investigating whether CPUs may be viable for inference in routine use in many sectors of the industry.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.