no code implementations • 1 Dec 2023 • Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger
Preference-based feedback is important for many applications in reinforcement learning where direct evaluation of a reward function is not feasible.
no code implementations • 21 Jul 2023 • Viraj Mehta, Ojash Neopane, Vikramjeet Das, Sen Lin, Jeff Schneider, Willie Neiswanger
Preference-based feedback is important for many applications where direct evaluation of a reward function is not feasible.