Search Results for author: Guanming Yao

Found 1 papers, 1 papers with code

UltraFeedback: Boosting Language Models with High-quality Feedback

2 code implementations2 Oct 2023 Ganqu Cui, Lifan Yuan, Ning Ding, Guanming Yao, Wei Zhu, Yuan Ni, Guotong Xie, Zhiyuan Liu, Maosong Sun

However, the scarcity of diverse, naturalistic datasets of human preferences on LLM outputs at scale poses a great challenge to RLHF as well as feedback learning research within the open-source community.

Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.