no code implementations • 1 Nov 2023 • Jaafar Mhamed, Shangding Gu
In this study, we define the safety critic, a mechanism that nullifies rewards obtained through violating safety constraints.
Benchmarking reinforcement-learning +1