Search Results for author: Esra'a Saleh

Found 2 papers, 0 papers with code

Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration

no code implementations • 4 Jun 2022 • Dustin Morrill, Esra'a Saleh, Michael Bowling, Amy Greenwald

Neural replicator dynamics (NeuRD) is an alternative to the foundational softmax policy gradient (SPG) algorithm motivated by online learning and evolutionary game theory.

Decision Making

Paper
Add Code

Should Models Be Accurate?

no code implementations • 22 May 2022 • Esra'a Saleh, John D. Martin, Anna Koop, Arash Pourzarabi, Michael Bowling

We focus our investigations on Dyna-style planning in a prediction setting.

Meta-Learning Model-based Reinforcement Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.