1 code implementation • 12 Apr 2024 • Hui Bai, Ran Cheng
Hyperparameter optimization plays a key role in the machine learning domain.
no code implementations • 7 Mar 2023 • Hui Bai, Ran Cheng, Yaochu Jin
This article presents a comprehensive survey of state-of-the-art methods for integrating EC into RL, referred to as evolutionary reinforcement learning (EvoRL).
no code implementations • 21 Sep 2022 • Hui Bai, Ruimin Shen, Yue Lin, Botian Xu, Ran Cheng
In comparison with the state-of-the-art RLlib, we empirically demonstrate the unique advantages of Lamarckian on benchmark tests with up to 6000 CPU cores: i) both the sampling efficiency and training speed are doubled when running PPO on Google football game; ii) the training speed is 13 times faster when running PBT+PPO on Pong game.