1 code implementation • 7 Mar 2023 • Nick Bührer, Zhejun Zhang, Alexander Liniger, Fisher Yu, Luc van Gool
To this end, we propose a safe model-free RL algorithm with a novel multiplicative value function consisting of a safety critic and a reward critic.