Search Results for author: Hado V. Hasselt

Found 1 papers, 0 papers with code

Double Q-learning

no code implementations NeurIPS 2010 Hado V. Hasselt

We apply the double estimator to Q-learning to construct Double Q-learning, a new off-policy reinforcement learning algorithm.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.