no code implementations • 26 Jul 2023 • Behzad Nourani-Koliji, Steven Bilaj, Amir Rezaei Balef, Setareh Maghsudi
In our nonstationary environment, variations in the base arms' distributions, causal relationships between rewards, or both, change the reward generation process.
no code implementations • 25 Dec 2022 • Behzad Nourani-Koliji, Saeed Ghoorchian, Setareh Maghsudi
The objective is to maximize the long-term average payoff, which is a linear function of the base arms' rewards and depends strongly on the network topology.