no code implementations • NeurIPS 2018 • Mahdi Imani, Seyede Fatemeh Ghoreishi, Ulisses M. Braga-Neto
In the online stage, the action with the maximum expected return with respect to the posterior distribution of the parameters is selected.
Decision Making Gaussian Processes