no code implementations • 26 Oct 2021 • Arsenii Kuznetsov, Alexander Grishin, Artem Tsypin, Arsenii Ashukha, Artur Kadurin, Dmitry Vetrov
Overestimation bias control techniques are used by the majority of high-performing off-policy reinforcement learning algorithms.
10 code implementations • ICML 2020 • Arsenii Kuznetsov, Pavel Shvechikov, Alexander Grishin, Dmitry Vetrov
The overestimation bias is one of the major impediments to accurate off-policy learning.