1 code implementation • 9 Oct 2023 • Zonghai Yao, Benjamin J Schloss, Sai P. Selvaraj
Existing works use human feedback to train large language models (LLMs) in general domain abstractive summarization and have obtained summary quality exceeding traditional likelihood training.