Exploration by Maximizing Rényi Entropy for Reward-Free RL Framework

11 Jun 2020 Chuheng Zhang Yuanying Cai Longbo Huang Jian Li

Exploration is essential for reinforcement learning (RL). To face the challenges of exploration, we consider a reward-free RL framework that completely separates exploration from exploitation and brings new challenges for exploration algorithms... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Datasets


Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
Entropy Regularization
Regularization
PPO
Policy Gradient Methods
Q-Learning
Off-Policy TD Control