no code implementations • 6 Jul 2023 • Li Jiang, Sijie Chen, JieLin Qiu, Haoran Xu, Wai Kin Chan, Zhao Ding
The prevalent use of benchmarks in current offline reinforcement learning (RL) research has led to a neglect of the imbalance of real-world dataset distributions in the development of models.