Search Results for author: Jiacai Liu

Found 2 papers, 0 papers with code

Elementary Analysis of Policy Gradient Methods

no code implementations • 4 Apr 2024 • Jiacai Liu, Wenye Li, Ke Wei

Projected policy gradient under the simplex parameterization, policy gradient and natural policy gradient under the softmax parameterization, are fundamental algorithms in reinforcement learning.

Policy Gradient Methods

Paper
Add Code

On the Linear Convergence of Policy Gradient under Hadamard Parameterization

no code implementations • 31 May 2023 • Jiacai Liu, Jinchi Chen, Ke Wei

To show the local linear convergence of the algorithm, we have indeed established the contraction of the sub-optimal probability $b_s^k$ (i. e., the probability of the output policy $\pi^k$ on non-optimal actions) when $k\ge k_0$.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.