Search Results for author: Ilai Bistritz

Found 8 papers, 0 papers with code

Equilibrium Bandits: Learning Optimal Equilibria of Unknown Dynamics

no code implementations27 Feb 2023 Siddharth Chandak, Ilai Bistritz, Nicholas Bambos

We prove that UECB achieves a regret of $\mathcal{O}(\log(T)+\tau_c\log(\tau_c)+\tau_c\log\log(T))$ for this equilibrium bandit problem where $\tau_c$ is the worst case approximate convergence time to equilibrium.

No Weighted-Regret Learning in Adversarial Bandits with Delays

no code implementations8 Mar 2021 Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet

Using these bounds, we show that FKM and EXP3 have no weighted-regret even for $d_{t}=O\left(t\log t\right)$.

Cooperative Multi-player Bandit Optimization

no code implementations NeurIPS 2020 Ilai Bistritz, Nicholas Bambos

At each turn, each player chooses an action and receives a reward that is an unknown function of all the players' actions.

Distributed Distillation for On-Device Learning

no code implementations NeurIPS 2020 Ilai Bistritz, Ariana Mann, Nicholas Bambos

We prove that our algorithm converges with probability 1 to a stationary point where all devices in the communication network distill the entire network's knowledge on the reference data, regardless of their local connections.

Online EXP3 Learning in Adversarial Bandits with Delayed Feedback

no code implementations NeurIPS 2019 Ilai Bistritz, Zhengyuan Zhou, Xi Chen, Nicholas Bambos, Jose Blanchet

An adversary chooses the cost of each arm in a bounded interval, and a sequence of feedback delays \left\{ d_{t}\right\} that are unknown to the player.

Do Informational Cascades Happen with Non-myopic Agents?

no code implementations3 May 2019 Ilai Bistritz, Nasimeh Heydaribeni, Achilleas Anastasopoulos

We provide a characterization of perfect Bayesian equilibria (PBE) with forward-looking strategies through a fixed-point equation of dimensionality that grows only quadratically with the number of players.

Distributed Learning for Channel Allocation Over a Shared Spectrum

no code implementations17 Feb 2019 S. M. Zafaruddin, Ilai Bistritz, Amir Leshem, Dusit Niyato

When the CSI is time varying and unknown to the users, the users face the challenge of both learning the channel statistics online and converge to a good channel allocation.

Distributed Multi-Player Bandits - a Game of Thrones Approach

no code implementations NeurIPS 2018 Ilai Bistritz, Amir Leshem

Each player has different expected rewards for the arms, and the instantaneous rewards are independent and identically distributed.

Cannot find the paper you are looking for? You can Submit a new open access paper.