Search Results for author: Ranchi Zhao

Found 1 papers, 1 papers with code

Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment

1 code implementation • 7 Nov 2023 • Geyang Guo, Ranchi Zhao, Tianyi Tang, Wayne Xin Zhao, Ji-Rong Wen

Alignment with human preference is a desired property of large language models (LLMs).

Imitation Learning

3

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.