Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation... (read more)
PDFMETHOD | TYPE | |
---|---|---|
![]() |
Hyperparameter Search |