Search Results for author: Nikolaus H. R. Howe

Found 2 papers, 1 papers with code

Defining and Characterizing Reward Hacking

no code implementations • 27 Sep 2022 • Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger

We provide the first formal definition of reward hacking, a phenomenon where optimizing an imperfect proxy reward function, $\mathcal{\tilde{R}}$, leads to poor performance according to the true reward function, $\mathcal{R}$.

Paper
Add Code

Myriad: a real-world testbed to bridge trajectory optimization and deep learning

1 code implementation • 22 Feb 2022 • Nikolaus H. R. Howe, Simon Dufort-Labbé, Nitarshan Rajkumar, Pierre-Luc Bacon

We present Myriad, a testbed written in JAX for learning and planning in real-world continuous environments.

BIG-bench Machine Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.