no code implementations • 2 Jan 2024 • Zhaoan Wang, Shaoping Xiao, Junchao Li, Jun Wang
However, our study illuminates the need for agent retraining to acquire new optimal policies under extreme weather events.
1 code implementation • 30 Apr 2023 • Junchao Li, Mingyu Cai, Zhen Kan, Shaoping Xiao
We formulate motion planning as a probabilistic-labeled partially observable Markov decision process (PL-POMDP) problem and use linear temporal logic (LTL) to express the complex task.
1 code implementation • 27 Dec 2021 • Yue Zhu, Mingyu Cai, Chris Schwarz, Junchao Li, Shaoping Xiao
At first, the obtained optimal policy from PPO is compared to those from DQN and DDQN.