Search Results for author: Hado V. Hasselt

Double Q-learning

We apply the double estimator to Q-learning to construct Double Q-learning, a new off-policy reinforcement learning algorithm.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.