Search Results for author: Eric Graves

Found 4 papers, 1 papers with code

Value-aware Importance Weighting for Off-policy Reinforcement Learning

no code implementations • 27 Jun 2023 • Kristopher De Asis, Eric Graves, Richard S. Sutton

Importance sampling is a central idea underlying off-policy prediction in reinforcement learning.

Paper
Add Code

Importance Sampling Placement in Off-Policy Temporal-Difference Methods

no code implementations • 18 Mar 2022 • Eric Graves, Sina Ghiassian

A central challenge to applying many off-policy reinforcement learning algorithms to real world problems is the variance introduced by importance sampling.

Paper
Add Code

Off-Policy Actor-Critic with Emphatic Weightings

1 code implementation • 16 Nov 2021 • Eric Graves, Ehsan Imani, Raksha Kumaraswamy, Martha White

A variety of theoretically-sound policy gradient algorithms exist for the on-policy setting due to the policy gradient theorem, which provides a simplified form for the gradient.

Paper
Code

An Off-policy Policy Gradient Theorem Using Emphatic Weightings

no code implementations • NeurIPS 2018 • Ehsan Imani, Eric Graves, Martha White

There have been a host of theoretically sound algorithms proposed for the on-policy setting, due to the existence of the policy gradient theorem which provides a simplified form for the gradient.

Policy Gradient Methods

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.