no code implementations • 30 Nov 2023 • Bernd Frauenknecht, Tobias Ehlgen, Sebastian Trimpe
We find that in the case of trajectory control, the standard model-based RL formulation used in approaches like PETS-MPPI and MBPO is not suitable.
Autonomous Driving Q-Learning +2