Search Results for author: Joshua Zhanson

Found 1 papers, 0 papers with code

On Proximal Policy Optimization's Heavy-tailed Gradients

no code implementations • 20 Feb 2021 • Saurabh Garg, Joshua Zhanson, Emilio Parisotto, Adarsh Prasad, J. Zico Kolter, Zachary C. Lipton, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Pradeep Ravikumar

In this paper, we present a detailed empirical study to characterize the heavy-tailed nature of the gradients of the PPO surrogate reward function.

Continuous Control

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.