no code implementations • 19 Oct 2023 • Alex Barbier-Chebbah, Christian L. Vestergaard, Jean-Baptiste Masson, Etienne Boursier
Built on this principle, we propose a new class of bandit algorithms that maximize an approximation to the information of a key variable within the system.
no code implementations • 4 Jul 2023 • Alex Barbier-Chebbah, Christian L. Vestergaard, Jean-Baptiste Masson
This paper addresses the exploration-exploitation dilemma inherent in decision-making, focusing on multi-armed bandit problems.