1 code implementation • 11 Jun 2019 • Yunlong Liu, Jianyang Zheng
How an agent can act optimally in stochastic, partially observable domains is a challenge problem, the standard approach to address this issue is to learn the domain model firstly and then based on the learned model to find the (near) optimal policy.
no code implementations • 5 Apr 2019 • Yunlong Liu, Jianyang Zheng
Planning in stochastic and partially observable environments is a central issue in artificial intelligence.