1 code implementation • 14 Mar 2024 • Muhammad Adnan, Akhil Arunkumar, Gaurav Jain, Prashant J. Nair, Ilya Soloveychik, Purushotham Kamath
This approach effectively reduces both the KV cache size and memory bandwidth usage without compromising model accuracy.
no code implementations • 18 Mar 2018 • Purushotham Kamath, Abhishek Singh, Debo Dutta
Fast Neural Architecture Construction (NAC) is a method to construct deep network architectures by pruning and expansion of a base network.
no code implementations • ICML 2018 AutoML Workshop 2018 • Purushotham Kamath, Abhishek Singh, Debo Dutta
Its key architectural features are the decoupling of the network generation from the network evaluation, support for network instrumentation, open model specification and a microservices based architecture for deployment at scale.