no code implementations • 7 Sep 2023 • Chengmin Zhou, Xin Lu, Jiapeng Dai, Bingding Huang, Xiaoxu Liu, Pasi Fränti
Reinforcement learning algorithms generate optimal or near-optimal time-sequential predictions.
no code implementations • 16 Jul 2023 • Chengmin Zhou, Chao Wang, Haseeb Hassan, Himat Shah, Bingding Huang, Pasi Fränti
Fifth, we systematically present the hybridization of Bayesian inference and RL which is a promising direction to improve the convergence of RL for better motion planning.
no code implementations • 5 Feb 2021 • Chengmin Zhou, Bingding Huang, Pasi Fränti
Intelligent robots provide a new insight into efficiency improvement in industrial and service scenarios to replace human labor.
no code implementations • 4 Feb 2021 • Chengmin Zhou, Bingding Huang, Pasi Fränti
These include traditional planning algorithms, supervised learning, optimal value reinforcement learning, policy gradient reinforcement learning.