Search Results for author: Jonathan Louëdec

Found 2 papers, 0 papers with code

Be Greedy in Multi-Armed Bandits

no code implementations4 Jan 2021 Matthieu Jedor, Jonathan Louëdec, Vianney Perchet

On the other hand, this heuristic performs reasonably well in practice and it even has sublinear, and even near-optimal, regret bounds in some very specific linear contextual and Bayesian bandit models.

Multi-Armed Bandits

Lifelong Learning in Multi-Armed Bandits

no code implementations28 Dec 2020 Matthieu Jedor, Jonathan Louëdec, Vianney Perchet

Continuously learning and leveraging the knowledge accumulated from prior tasks in order to improve future performance is a long standing machine learning problem.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.