Search Results for author: Vikramjeet Das

Found 2 papers, 0 papers with code

Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

no code implementations1 Dec 2023 Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger

Preference-based feedback is important for many applications in reinforcement learning where direct evaluation of a reward function is not feasible.

reinforcement-learning

Kernelized Offline Contextual Dueling Bandits

no code implementations21 Jul 2023 Viraj Mehta, Ojash Neopane, Vikramjeet Das, Sen Lin, Jeff Schneider, Willie Neiswanger

Preference-based feedback is important for many applications where direct evaluation of a reward function is not feasible.

Cannot find the paper you are looking for? You can Submit a new open access paper.