no code implementations • 12 Jun 2020 • Kwangyeon Kim, Akshita Gupta, Hong-Cheol Choi, Inseok Hwang
The proposed algorithm is developed for the discrete state and action space and utilizes a multi-class support vector machine (SVM) to represent the policy.