no code implementations • 10 Feb 2024 • Yuriy Dorn, Aleksandr Katrutsa, Ilgam Latypov, Andrey Pudovikov
In this study, we propose a new method for constructing UCB-type algorithms for stochastic multi-armed bandits based on general convex optimization methods with an inexact oracle.