no code implementations • 4 Jun 2022 • Xue-Kun Jin, Xu-Hui Liu, Shengyi Jiang, Yang Yu
Value function estimation is an indispensable subroutine in reinforcement learning, which becomes more challenging in the offline setting.
Off-policy evaluation reinforcement-learning