no code implementations • 12 Mar 2024 • Pulkit Pattnaik, Rishabh Maheshwary, Kelechi Ogueji, Vikas Yadav, Sathwik Tejaswi Madhusudhan
With availability of such quality ratings for multiple responses, we propose utilizing these responses to create multiple preference pairs for a given prompt.