no code implementations • 20 Feb 2023 • Jing Wang, Meichen Song, Feng Gao, Boyi Liu, Zhaoran Wang, Yi Wu
We initiate the study of how to perturb the reward in a zero-sum Markov game with two players to induce a desirable Nash equilibrium, namely arbitrating.
Multi-agent Reinforcement Learning reinforcement-learning +1