Search Results for author: SangLyul Cho

Found 1 papers, 1 papers with code

Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs

1 code implementation16 Feb 2024 Yeonhong Park, Jake Hyun, SangLyul Cho, Bonggeun Sim, Jae W. Lee

Recently, considerable efforts have been directed towards compressing Large Language Models (LLMs), which showcase groundbreaking capabilities across diverse applications but entail significant deployment costs due to their large sizes.

Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.