Search Results for author: Jiangcun Du

Found 1 papers, 0 papers with code

A Comprehensive Evaluation of Quantization Strategies for Large Language Models

no code implementations26 Feb 2024 Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

Our experimental results indicate that LLMs with 4-bit quantization can retain performance comparable to their non-quantized counterparts, and perplexity can serve as a proxy metric for quantized LLMs on most benchmarks.

Language Modelling Quantization

Cannot find the paper you are looking for? You can Submit a new open access paper.