Search Results for author: Enrique Munoz de Cote

Found 5 papers, 0 papers with code

Compatible features for Monotonic Policy Improvement

no code implementations9 Oct 2019 Marcin B. Tomczak, Sergio Valcarcel Macua, Enrique Munoz de Cote, Peter Vrancx

In this work we establish conditions under which the parametric approximation of the critic does not introduce bias to the updates of surrogate objective.

Adaptive Sensor Placement for Continuous Spaces

no code implementations16 May 2019 James A. Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S. Leslie, Sattar Vakili, Enrique Munoz de Cote

We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events.

Thompson Sampling

Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning

no code implementations28 Oct 2017 Sergio Valcarcel Macua, Aleksi Tukiainen, Daniel García-Ocaña Hernández, David Baldazo, Enrique Munoz de Cote, Santiago Zazo

We propose a fully distributed actor-critic algorithm approximated by deep neural networks, named \textit{Diff-DAC}, with application to single-task and to average multitask reinforcement learning (MRL).

reinforcement-learning Reinforcement Learning (RL)

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

no code implementations28 Jul 2017 Pablo Hernandez-Leal, Michael Kaisers, Tim Baarslag, Enrique Munoz de Cote

The key challenge in multiagent learning is learning a best response to the behaviour of other agents, which may be non-stationary: if the other agents adapt their strategy as well, the learning target moves.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.