Search Results for author: Donghai Hong

Found 1 papers, 0 papers with code

Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction

no code implementations • 4 Feb 2024 • Jiaming Ji, Boyuan Chen, Hantao Lou, Donghai Hong, Borong Zhang, Xuehai Pan, Juntao Dai, Yaodong Yang

Here we introduce Aligner, a new efficient alignment paradigm that bypasses the whole RLHF process by learning the correctional residuals between the aligned and the unaligned answers.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.