1 code implementation • 9 Feb 2024 • Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling
In this paper, we formalize a novel but general RL framework - Monitored MDPs - where the agent cannot always observe rewards.
no code implementations • 29 Oct 2021 • Montaser Mohammedalamen, Dustin Morrill, Alexander Sieusahai, Yash Satsangi, Michael Bowling
An agent that could learn to be cautious would overcome this challenge by discovering for itself when and how to behave cautiously.
1 code implementation • 15 Jan 2019 • Montaser Mohammedalamen, Waleed D. Khamies, Benjamin Rosman
In this paper, We Apply Reinforcement learning (RL) techniques to train a realistic biomechanical model to work with different people and on different walking environments.