Regularly Updated Deterministic Policy Gradient Algorithm

1 Jul 2020 Shuai Han Wenbo Zhou Shuai Lü Jiayu Yu

Deep Deterministic Policy Gradient (DDPG) algorithm is one of the most well-known reinforcement learning methods. However, this method is inefficient and unstable in practical applications... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
Convolution
Convolutions
Double Q-learning
Off-Policy TD Control
Experience Replay
Replay Memory
Weight Decay
Regularization
ReLU
Activation Functions
Adam
Stochastic Optimization
Batch Normalization
Normalization
Dense Connections
Feedforward Networks
DDPG
Policy Gradient Methods
Q-Learning
Off-Policy TD Control
Clipped Double Q-learning
Off-Policy TD Control