1 code implementation • 21 Dec 2020 • Brendon Matusch, Jimmy Ba, Danijar Hafner
Moreover, input entropy and information gain correlate more strongly with human similarity than task reward does, suggesting the use of intrinsic objectives for designing agents that behave similarly to human players.