Search Results for author: Pablo Hernandez-Leal

Found 12 papers, 2 papers with code

Robust Risk-Sensitive Reinforcement Learning Agents for Trading Markets

no code implementations • 16 Jul 2021 • Yue Gao, Kry Yik Chau Lui, Pablo Hernandez-Leal

Trading markets represent a real-world financial application to deploy reinforcement learning agents, however, they carry hard fundamental challenges such as high variance and costly exploration.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

CDT: Cascading Decision Trees for Explainable Reinforcement Learning

1 code implementation • 15 Nov 2020 • Zihan Ding, Pablo Hernandez-Leal, Gavin Weiguang Ding, Changjian Li, Ruitong Huang

As a second contribution our study reveals limitations of explaining black-box policies via imitation learning with tree-based explainable models, due to its inherent instability.

Explainable Models Imitation Learning +3

Paper
Code

Work in Progress: Temporally Extended Auxiliary Tasks

no code implementations • 1 Apr 2020 • Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Our overall conclusions are that TD-AE increases the robustness of the A2C algorithm to the trajectory length and while promising, further study is required to fully understand the relationship between auxiliary task prediction timescale and the agent's performance.

Paper
Add Code

On Hard Exploration for Reinforcement Learning: a Case Study in Pommerman

no code implementations • 26 Jul 2019 • Chao Gao, Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

In this paper, we illuminate reasons behind this failure by providing a thorough analysis on the hardness of random exploration in Pommerman.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Action Guidance with MCTS for Deep Reinforcement Learning

no code implementations • 25 Jul 2019 • Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Deep reinforcement learning has achieved great successes in recent years, however, one main challenge is the sample inefficiency.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Terminal Prediction as an Auxiliary Task for Deep Reinforcement Learning

no code implementations • 24 Jul 2019 • Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Deep reinforcement learning has achieved great successes in recent years, but there are still open challenges, such as convergence to locally optimal policies and sample inefficiency.

Atari Games reinforcement-learning +2

Paper
Add Code

Agent Modeling as Auxiliary Task for Deep Reinforcement Learning

no code implementations • 22 Jul 2019 • Pablo Hernandez-Leal, Bilal Kartal, Matthew E. Taylor

In this paper we explore how actor-critic methods in deep reinforcement learning, in particular Asynchronous Advantage Actor-Critic (A3C), can be extended with agent modeling.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition

1 code implementation • 20 Apr 2019 • Chao Gao, Pablo Hernandez-Leal, Bilal Kartal, Matthew E. Taylor

The Pommerman Team Environment is a recently proposed benchmark which involves a multi-agent domain with challenges such as partial observability, decentralized execution (without communication), and very sparse and delayed rewards.

Reinforcement Learning (RL)

Paper
Code

Safer Deep RL with Shallow MCTS: A Case Study in Pommerman

no code implementations • 10 Apr 2019 • Bilal Kartal, Pablo Hernandez-Leal, Chao GAO, Matthew E. Taylor

In this paper, we shed light into the reasons behind this failure by exemplifying and analyzing the high rate of catastrophic events (i. e., suicides) that happen under random exploration in this domain.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Using Monte Carlo Tree Search as a Demonstrator within Asynchronous Deep RL

no code implementations • 30 Nov 2018 • Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Deep reinforcement learning (DRL) has achieved great successes in recent years with the help of novel methods and higher compute power.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Survey and Critique of Multiagent Deep Reinforcement Learning

no code implementations • 12 Oct 2018 • Pablo Hernandez-Leal, Bilal Kartal, Matthew E. Taylor

The primary goal of this article is to provide a clear overview of current multiagent deep reinforcement learning (MDRL) literature.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

A Survey of Learning in Multiagent Environments: Dealing with Non-Stationarity

no code implementations • 28 Jul 2017 • Pablo Hernandez-Leal, Michael Kaisers, Tim Baarslag, Enrique Munoz de Cote

The key challenge in multiagent learning is learning a best response to the behaviour of other agents, which may be non-stationary: if the other agents adapt their strategy as well, the learning target moves.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.