Search Results for author: Stephen Zhao

Found 2 papers, 2 papers with code

Proximal Learning With Opponent-Learning Awareness

1 code implementation18 Oct 2022 Stephen Zhao, Chris Lu, Roger Baker Grosse, Jakob Nicolaus Foerster

This problem is especially pronounced in the opponent modeling setting, where the opponent's policy is unknown and must be inferred from observations; in such settings, LOLA is ill-specified because behaviorally equivalent opponent policies can result in non-equivalent updates.

Multi-agent Reinforcement Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.