Proximal Policy Optimization via Enhanced Exploration Efficiency

11 Nov 2020 Junwei Zhang Zhenghao Zhang Shuai Han Shuai Lü

Proximal policy optimization (PPO) algorithm is a deep reinforcement learning algorithm with outstanding performance, especially in continuous control tasks. But the performance of this method is still affected by its exploration ability... (read more)

PDF Abstract
No code implementations yet. Submit your code now


  Add Datasets introduced or used in this paper

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper

Entropy Regularization
Policy Gradient Methods