no code implementations • 19 Jan 2020 • Matthew Cockcroft, Shahil Mawjee, Steven James, Pravesh Ranchod
We present a method for learning options from segmented demonstration trajectories.
3 code implementations • 5 Sep 2015 • Warwick Masson, Pravesh Ranchod, George Konidaris
We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions-discrete actions with continuous parameters.