Search Results for author: Steven Bilaj

Found 3 papers, 0 papers with code

Meta Learning in Bandits within Shared Affine Subspaces

no code implementations • 31 Mar 2024 • Steven Bilaj, Sofien Dhouib, Setareh Maghsudi

We study the problem of meta-learning several contextual stochastic bandits tasks by leveraging their concentration around a low-dimensional affine subspace, which we learn via online principal component analysis to reduce the expected regret over the encountered bandits.

Meta-Learning Thompson Sampling

Paper
Add Code

Piecewise-Stationary Combinatorial Semi-Bandit with Causally Related Rewards

no code implementations • 26 Jul 2023 • Behzad Nourani-Koliji, Steven Bilaj, Amir Rezaei Balef, Setareh Maghsudi

In our nonstationary environment, variations in the base arms' distributions, causal relationships between rewards, or both, change the reward generation process.

Decision Making

Paper
Add Code

Hypothesis Transfer in Bandits by Weighted Models

no code implementations • 14 Nov 2022 • Steven Bilaj, Sofien Dhouib, Setareh Maghsudi

We consider the problem of contextual multi-armed bandits in the setting of hypothesis transfer learning.

Multi-Armed Bandits Transfer Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.