Search Results for author: Youzhi Zhang

Found 6 papers, 2 papers with code

Converging to Team-Maxmin Equilibria in Zero-Sum Multiplayer Games

no code implementations • ICML 2020 • Youzhi Zhang, Bo An

Second, we design an ISG variant for TMEs (ISGT) by exploiting that a TME is an NE maximizing the team’s utility and show that ISGT converges to a TME and the impossibility of relaxing conditions in ISGT.

Paper
Add Code

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems

1 code implementation • 19 Apr 2024 • Pengdeng Li, Shuxin Li, Xinrun Wang, Jakub Cerny, Youzhi Zhang, Stephen Mcaleer, Hau Chan, Bo An

Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks.

Graph Learning

Paper
Code

Offline Equilibrium Finding

1 code implementation • 12 Jul 2022 • Shuxin Li, Xinrun Wang, Youzhi Zhang, Jakub Cerny, Pengdeng Li, Hau Chan, Bo An

Extensive experimental results demonstrate the superiority of our approach over offline RL algorithms and the importance of using model-based methods for OEF problems.

Offline RL

Paper
Code

Solving Large-Scale Extensive-Form Network Security Games via Neural Fictitious Self-Play

no code implementations • 2 Jun 2021 • Wanqi Xue, Youzhi Zhang, Shuxin Li, Xinrun Wang, Bo An, Chai Kiat Yeo

Securing networked infrastructures is important in the real world.

Paper
Add Code

CFR-MIX: Solving Imperfect Information Extensive-Form Games with Combinatorial Action Space

no code implementations • 18 May 2021 • Shuxin Li, Youzhi Zhang, Xinrun Wang, Wanqi Xue, Bo An

The challenge of solving this type of game is that the team's joint action space grows exponentially with the number of agents, which results in the inefficiency of the existing algorithms, e. g., Counterfactual Regret Minimization (CFR).

counterfactual

Paper
Add Code

Learning Expensive Coordination: An Event-Based Deep RL Approach

no code implementations • ICLR 2020 • Zhenyu Shi*, Runsheng Yu*, Xinrun Wang*, Rundong Wang, Youzhi Zhang, Hanjiang Lai, Bo An

The main difficulties of expensive coordination are that i) the leader has to consider the long-term effect and predict the followers' behaviors when assigning bonuses and ii) the complex interactions between followers make the training process hard to converge, especially when the leader's policy changes with time.

Decision Making Multi-agent Reinforcement Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.