no code implementations • NeurIPS 2010 • Jeffrey Johns, Christopher Painter-Wakefield, Ronald Parr
We demonstrate that warm starts, as well as the efficiency of LCP solvers, can speed up policy iteration.
feature selection Reinforcement Learning (RL)