Diversity Actor-Critic: Sample-Aware Entropy Regularization for Sample-Efficient Exploration

2 Jun 2020 Seungyul Han Youngchul Sung

Policy entropy regularization is commonly used for better exploration in deep reinforcement learning (RL). However, policy entropy regularization is sample-inefficient in off-policy learning since it does not take the distribution of previous samples stored in the replay buffer into account... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
DAC
Hyperparameter Search
Entropy Regularization
Regularization