Asynchronous Advantage Actor Critic: Non-asymptotic Analysis and Linear Speedup

31 Dec 2020 Han Shen Kaiqing Zhang Mingyi Hong Tianyi Chen

Asynchronous and parallel implementation of standard reinforcement learning (RL) algorithms is a key enabler of the tremendous success of modern RL. Among many asynchronous RL algorithms, arguably the most popular and effective one is the asynchronous advantage actor-critic (A3C) algorithm... (read more)

PDF Abstract
No code implementations yet. Submit your code now

Datasets


Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
Softmax
Output Functions
Entropy Regularization
Regularization
Convolution
Convolutions
Dense Connections
Feedforward Networks
A3C
Policy Gradient Methods