no code implementations • 23 May 2024 • Luise Ge, Daniel Halpern, Evi Micha, Ariel D. Procaccia, Itai Shapira, Yevgeniy Vorobeychik, Junlin Wu
The problem of learning a reward function is one of preference aggregation that, we argue, largely falls within the scope of social choice theory.
no code implementations • 4 May 2024 • Luise Ge, Brendan Juba, Yevgeniy Vorobeychik
Our results thus exhibit a qualitative learnability gap between passive and active learning from pairwise preference queries, demonstrating the value of the ability to select pairwise queries for utility learning.