no code implementations • 9 Oct 2022 • Jiafei Lyu, Aicheng Gong, Le Wan, Zongqing Lu, Xiu Li
We present state advantage weighting for offline reinforcement learning (RL).
D4RL Offline RL +2