Search Results for author: Abhijit Mazumdar

Found 2 papers, 0 papers with code

Safe Reinforcement Learning for Constrained Markov Decision Processes with Stochastic Stopping Time

no code implementations23 Mar 2024 Abhijit Mazumdar, Rafal Wisniewski, Manuela L. Bujorianu

In this paper, we present an online reinforcement learning algorithm for constrained Markov decision processes with a safety constraint.

Efficient Exploration Safe Reinforcement Learning

Online Model-free Safety Verification for Markov Decision Processes Without Safety Violation

no code implementations8 Dec 2023 Abhijit Mazumdar, Rafal Wisniewski, Manuela L. Bujorianu

We then use an off-policy temporal difference learning method with importance sampling to learn the safety function corresponding to the given policy.

Cannot find the paper you are looking for? You can Submit a new open access paper.