no code implementations • 20 May 2021 • YunLong Li, Yiming Peng, Dengzheng Zhang, Yingan Mai, Zhengrong Ruan
The energy consumption of the HVAC system accounts for a significant portion of the energy consumption of the public building system, and using an efficient energy consumption prediction model can assist it in carrying out effective energy-saving transformation.
no code implementations • 31 Jan 2021 • Yiming Peng, Hisao Ishibuchi
Since equivalent solutions are overlapping (i. e., occupying the same position) in the objective space, standard diversity estimators such as crowding distance are likely to select one of them and discard the others, which may cause diversity loss in the decision space.
no code implementations • 21 Apr 2020 • Yiming Peng, Hisao Ishibuchi
With a clearing mechanism and a greedy removal strategy, our proposed algorithm can effectively preserve equivalent Pareto optimal solutions (i. e., different Pareto optimal solutions with same objective values).
no code implementations • 14 Feb 2019 • Gang Chen, Yiming Peng
We propose a new policy iteration theory as an important extension of soft policy iteration and Soft Actor-Critic (SAC), one of the most efficient model free algorithms for deep reinforcement learning.
no code implementations • 2 Sep 2018 • Gang Chen, Yiming Peng, Mengjie Zhang
With the aim of improving sample efficiency and learning performance, we will develop a new DRL algorithm in this paper that seamless integrates entropy-induced and bootstrap-induced techniques for efficient and deep exploration of the learning environment.
no code implementations • 17 Apr 2018 • Gang Chen, Yiming Peng, Mengjie Zhang
While PPO is inspired by the same learning theory that justifies trust region policy optimization (TRPO), PPO substantially simplifies algorithm design and improves data efficiency by performing multiple epochs of \emph{clipped policy optimization} from sampled data.