no code implementations • 13 May 2024 • Theodore Jerome Tinker, Kenji Doya, Jun Tani
Two rewards encouraging efficient exploration are the entropy of action policy and curiosity for information gain.
Efficient Exploration Navigate +1