Search Results for author: Jaafar Mhamed

Found 1 papers, 0 papers with code

SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization

no code implementations1 Nov 2023 Jaafar Mhamed, Shangding Gu

In this study, we define the safety critic, a mechanism that nullifies rewards obtained through violating safety constraints.

Benchmarking reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.