no code implementations • 3 Nov 2023 • Jinhang Zuo, Zhiyao Zhang, Xuchuang Wang, Cheng Chen, Shuai Li, John C. S. Lui, Mohammad Hajiesmaili, Adam Wierman
Cooperative multi-agent multi-armed bandits (CMA2B) consider the collaborative efforts of multiple agents in a shared multi-armed bandit game.
no code implementations • 8 Aug 2023 • Lin Yang, Xuchuang Wang, Mohammad Hajiesmaili, Lijun Zhang, John C. S. Lui, Don Towsley
Prior algorithms in both paradigms achieve the optimal group regret.
no code implementations • 15 Feb 2023 • Yu-Zhen Janice Chen, Lin Yang, Xuchuang Wang, Xutong Liu, Mohammad Hajiesmaili, John C. S. Lui, Don Towsley
We propose ODC, an on-demand communication protocol that tailors the communication of each pair of agents based on their empirical pull times.
no code implementations • 17 Jun 2022 • Xuchuang Wang, Hong Xie, John C. S. Lui
When the "per-load" reward follows a Gaussian distribution, we prove a sample complexity lower bound of learning the capacity from load-dependent rewards and also a regret lower bound of this new MP-MAB problem.
no code implementations • 28 Apr 2022 • Xuchuang Wang, Hong Xie, John C. S. Lui
The reward from a shareable arm is equal to the "per-load" reward multiplied by the minimum between the number of players pulling the arm and the arm's maximal shareable resources.