1 code implementation • 5 Jul 2023 • Ben Norman, Jeff Clune
We argue a core barrier prohibiting many RL approaches from learning intelligent exploration is that the methods attempt to explore and exploit simultaneously, which harms both exploration and exploitation as the goals often conflict.