Search Results for author: Yiming Peng

Found 6 papers, 0 papers with code

XGBoost energy consumption prediction based on multi-system data HVAC

no code implementations20 May 2021 YunLong Li, Yiming Peng, Dengzheng Zhang, Yingan Mai, Zhengrong Ruan

The energy consumption of the HVAC system accounts for a significant portion of the energy consumption of the public building system, and using an efficient energy consumption prediction model can assist it in carrying out effective energy-saving transformation.

Niching Diversity Estimation for Multi-modal Multi-objective Optimization

no code implementations31 Jan 2021 Yiming Peng, Hisao Ishibuchi

Since equivalent solutions are overlapping (i. e., occupying the same position) in the objective space, standard diversity estimators such as crowding distance are likely to select one of them and discard the others, which may cause diversity loss in the decision space.

A Decomposition-based Large-scale Multi-modal Multi-objective Optimization Algorithm

no code implementations21 Apr 2020 Yiming Peng, Hisao Ishibuchi

With a clearing mechanism and a greedy removal strategy, our proposed algorithm can effectively preserve equivalent Pareto optimal solutions (i. e., different Pareto optimal solutions with same objective values).

Off-Policy Actor-Critic in an Ensemble: Achieving Maximum General Entropy and Effective Environment Exploration in Deep Reinforcement Learning

no code implementations14 Feb 2019 Gang Chen, Yiming Peng

We propose a new policy iteration theory as an important extension of soft policy iteration and Soft Actor-Critic (SAC), one of the most efficient model free algorithms for deep reinforcement learning.

Effective Exploration for Deep Reinforcement Learning via Bootstrapped Q-Ensembles under Tsallis Entropy Regularization

no code implementations2 Sep 2018 Gang Chen, Yiming Peng, Mengjie Zhang

With the aim of improving sample efficiency and learning performance, we will develop a new DRL algorithm in this paper that seamless integrates entropy-induced and bootstrap-induced techniques for efficient and deep exploration of the learning environment.

Reinforcement Learning (RL)

An Adaptive Clipping Approach for Proximal Policy Optimization

no code implementations17 Apr 2018 Gang Chen, Yiming Peng, Mengjie Zhang

While PPO is inspired by the same learning theory that justifies trust region policy optimization (TRPO), PPO substantially simplifies algorithm design and improves data efficiency by performing multiple epochs of \emph{clipped policy optimization} from sampled data.

Learning Theory

Cannot find the paper you are looking for? You can Submit a new open access paper.