no code implementations • 17 Mar 2023 • Xiuding Cai, Jiao Chen, Yaoyao Zhu, Beimin Wang, Yu Yao
In this paper, Policy Constraint Q-Learning (PCQL), a data-driven reinforcement learning algorithm for solving the problem of learning anesthesia strategies on real clinical datasets, is proposed.