Search Results for author: Nithyanand Kota

Found 1 papers, 0 papers with code

A K-fold Method for Baseline Estimation in Policy Gradient Algorithms

no code implementations3 Jan 2017 Nithyanand Kota, Abhishek Mishra, Sunil Srinivasa, Xi, Chen, Pieter Abbeel

The high variance issue in unbiased policy-gradient methods such as VPG and REINFORCE is typically mitigated by adding a baseline.

Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.