Search Results for author: Elena Smirnova

Found 8 papers, 2 papers with code

On the Convergence of Smooth Regularized Approximate Value Iteration Schemes

no code implementations • NeurIPS 2020 • Elena Smirnova, Elvis Dohmatob

Entropy regularization, smoothing of Q-values and neural network function approximator are key components of the state-of-the-art reinforcement learning (RL) algorithms, such as Soft Actor-Critic~\cite{haarnoja2018soft}.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

On the Convergence of Approximate and Regularized Policy Iteration Schemes

no code implementations • 20 Sep 2019 • Elena Smirnova, Elvis Dohmatob

Entropy regularized algorithms such as Soft Q-learning and Soft Actor-Critic, recently showed state-of-the-art performance on a number of challenging reinforcement learning (RL) tasks.

Q-Learning Reinforcement Learning (RL)

Paper
Add Code

Distributionally Robust Counterfactual Risk Minimization

no code implementations • 14 Jun 2019 • Louis Faury, Ugo Tanielian, Flavian vasile, Elena Smirnova, Elvis Dohmatob

This manuscript introduces the idea of using Distributionally Robust Optimization (DRO) for the Counterfactual Risk Minimization (CRM) problem.

counterfactual Decision Making

Paper
Add Code

Distributionally Robust Reinforcement Learning

no code implementations • 23 Feb 2019 • Elena Smirnova, Elvis Dohmatob, Jérémie Mary

Our formulation results in a efficient algorithm that accounts for a simple re-weighting of policy actions in the standard policy iteration scheme.

Continuous Control Q-Learning +2

Paper
Add Code

Action-conditional Sequence Modeling for Recommendation

no code implementations • 7 Sep 2018 • Elena Smirnova

In this paper, we consider interactions triggered by the recommendations of deployed recommender system in addition to browsing behavior.

Collaborative Filtering Recommendation Systems

Paper
Add Code

Recurrent Neural Networks for Long and Short-Term Sequential Recommendation

no code implementations • 23 Jul 2018 • Kiewan Villatel, Elena Smirnova, Jérémie Mary, Philippe Preux

Recommender systems objectives can be broadly characterized as modeling user preferences over short-or long-term time horizon.

Dimensionality Reduction Sequential Recommendation +1

Paper
Add Code

Contextual Sequence Modeling for Recommendation with Recurrent Neural Networks

1 code implementation • 23 Jun 2017 • Elena Smirnova, Flavian vasile

Recommendations can greatly benefit from good representations of the user state at recommendation time.

Session-Based Recommendations

Paper
Code

Meta-Prod2Vec - Product Embeddings Using Side-Information for Recommendation

2 code implementations • 25 Jul 2016 • Flavian Vasile, Elena Smirnova, Alexis Conneau

We propose Meta-Prod2vec, a novel method to compute item similarities for recommendation that leverages existing item metadata.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.