no code implementations • 2 Jan 2024 • Jie Feng, Ke Wei, Jinchi Chen
Natural policy gradient (NPG) and its variants are widely-used policy search methods in reinforcement learning.
no code implementations • 31 May 2023 • Jiacai Liu, Jinchi Chen, Ke Wei
To show the local linear convergence of the algorithm, we have indeed established the contraction of the sub-optimal probability $b_s^k$ (i. e., the probability of the output policy $\pi^k$ on non-optimal actions) when $k\ge k_0$.
no code implementations • 29 Aug 2019 • Jinchi Chen, Xiaxia Wang, Gong Cheng, Evgeny Kharlamov, Yuzhong Qu
Reusing published datasets on the Web is of great interest to researchers and developers.
no code implementations • 2 Jul 2019 • Xiaxia Wang, Jinchi Chen, Shuxin Li, Gong Cheng, Jeff Z. Pan, Evgeny Kharlamov, Yuzhong Qu
Reusing existing datasets is of considerable significance to researchers and developers.