Search Results for author: Jaden B. Travnik

TIDBD: Adapting Temporal-difference Step-sizes Through Stochastic Meta-descent

In this paper, we introduce a method for adapting the step-sizes of temporal difference (TD) learning.

Paper
Add Code

The relationship between a reinforcement learning (RL) agent and an asynchronous environment is often ignored.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.