Search Results for author: Houssam Zenati

Found 12 papers, 8 papers with code

Covariance-Adaptive Least-Squares Algorithm for Stochastic Combinatorial Semi-Bandits

no code implementations23 Feb 2024 Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

We address the problem of stochastic combinatorial semi-bandits, where a player can select from P subsets of a set containing d base items.

Sequential Counterfactual Risk Minimization

1 code implementation23 Feb 2023 Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

Counterfactual Risk Minimization (CRM) is a framework for dealing with the logged bandit feedback problem, where the goal is to improve a logging policy using offline data.

counterfactual

Nested bandits

no code implementations19 Jun 2022 Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier, Houssam Zenati

In many online decision processes, the optimizing agent is called to choose between large numbers of alternatives with many inherent similarities; in turn, these similarities imply closely correlated losses that may confound standard discrete choice models and bandit algorithms.

Discrete Choice Models

Efficient Kernel UCB for Contextual Bandits

1 code implementation11 Feb 2022 Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

While standard methods require a O(CT^3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems.

Computational Efficiency Multi-Armed Bandits

Optimistic mirror descent in saddle-point problems: Going the extra(-gradient) mile

no code implementations ICLR 2019 Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond.

Manifold regularization with GANs for semi-supervised learning

1 code implementation ICLR 2019 Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay Chandrasekhar

Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images.

Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

no code implementations7 Jul 2018 Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond.

Semi-Supervised Learning with GANs: Revisiting Manifold Regularization

2 code implementations23 May 2018 Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay R. Chandrasekhar

GANS are powerful generative models that are able to model the manifold of natural images.

Cannot find the paper you are looking for? You can Submit a new open access paper.