1 code implementation • 15 Apr 2024 • Siyan Zhao, Daniel Israel, Guy Van Den Broeck, Aditya Grover
In this work, we highlight the following pitfall of prefilling: for batches containing high-varying prompt lengths, significant computation is wasted by the standard practice of padding sequences to the maximum length.
no code implementations • 17 Oct 2023 • Siyan Zhao, John Dang, Aditya Grover
We introduce Group Preference Optimization (GPO), an alignment framework that steers language models to preferences of individual groups in a few-shot manner.