Search Results for author: Rodrigo Toro Icarte

Found 13 papers, 8 papers with code

Learning Symbolic Representations for Reinforcement Learning of Non-Markovian Behavior

no code implementations8 Jan 2023 Phillip J. K. Christoffersen, Andrew C. Li, Rodrigo Toro Icarte, Sheila A. McIlraith

Recent work has leveraged Knowledge Representation (KR) to provide a symbolic abstraction of aspects of the state that summarize reward-relevant properties of the state-action history and support learning a Markovian decomposition of the problem in terms of an automaton over the KR.

reinforcement-learning Reinforcement Learning (RL)

Noisy Symbolic Abstractions for Deep RL: A case study with Reward Machines

no code implementations20 Nov 2022 Andrew C. Li, Zizhao Chen, Pashootan Vaezipoor, Toryn Q. Klassen, Rodrigo Toro Icarte, Sheila A. McIlraith

Natural and formal languages provide an effective mechanism for humans to specify instructions and reward functions.

Challenges to Solving Combinatorially Hard Long-Horizon Deep RL Tasks

1 code implementation3 Jun 2022 Andrew C. Li, Pashootan Vaezipoor, Rodrigo Toro Icarte, Sheila A. McIlraith

Deep reinforcement learning has shown promise in discrete domains requiring complex reasoning, including games such as Chess, Go, and Hanabi.

Learning Reward Machines: A Study in Partially Observable Reinforcement Learning

no code implementations17 Dec 2021 Rodrigo Toro Icarte, Ethan Waldie, Toryn Q. Klassen, Richard Valenzano, Margarita P. Castro, Sheila A. McIlraith

Here we show that RMs can be learned from experience, instead of being specified by the user, and that the resulting problem decomposition can be used to effectively solve partially observable RL problems.

Partially Observable Reinforcement Learning Problem Decomposition +2

Be Considerate: Objectives, Side Effects, and Deciding How to Act

no code implementations4 Jun 2021 Parand Alizadeh Alamdari, Toryn Q. Klassen, Rodrigo Toro Icarte, Sheila A. McIlraith

We endow RL agents with the ability to contemplate such impact by augmenting their reward based on expectation of future return by others in the environment, providing different criteria for characterizing impact.

Decision Making Reinforcement Learning (RL)

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

1 code implementation13 Feb 2021 Pashootan Vaezipoor, Andrew Li, Rodrigo Toro Icarte, Sheila Mcilraith

We address the problem of teaching a deep reinforcement learning (RL) agent to follow instructions in multi-task environments.

Reinforcement Learning (RL)

Interpretable Sequence Classification via Discrete Optimization

1 code implementation6 Oct 2020 Maayan Shvo, Andrew C. Li, Rodrigo Toro Icarte, Sheila A. McIlraith

Our automata-based classifiers are interpretable---supporting explanation, counterfactual reasoning, and human-in-the-loop modification---and have strong empirical performance.

Classification counterfactual +4

Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

3 code implementations6 Oct 2020 Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano, Sheila A. McIlraith

First, we propose reward machines, a type of finite state machine that supports the specification of reward functions while exposing reward function structure.

counterfactual Counterfactual Reasoning +3

Learning Reward Machines for Partially Observable Reinforcement Learning

1 code implementation NeurIPS 2019 Rodrigo Toro Icarte, Ethan Waldie, Toryn Klassen, Rick Valenzano, Margarita Castro, Sheila Mcilraith

Reward Machines (RMs), originally proposed for specifying problems in Reinforcement Learning (RL), provide a structured, automata-based representation of a reward function that allows an agent to decompose problems into subproblems that can be efficiently learned using off-policy learning.

Partially Observable Reinforcement Learning Problem Decomposition +2

Using Reward Machines for High-Level Task Specification and Decomposition in Reinforcement Learning

1 code implementation ICML 2018 Rodrigo Toro Icarte, Toryn Klassen, Richard Valenzano, Sheila Mcilraith

In this paper we propose Reward Machines {—} a type of finite state machine that supports the specification of reward functions while exposing reward function structure to the learner and supporting decomposition.

Hierarchical Reinforcement Learning Q-Learning +2

How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval

1 code implementation24 May 2017 Rodrigo Toro Icarte, Jorge A. Baier, Cristian Ruz, Alvaro Soto

Consequently, a main conclusion of this work is that general-purpose commonsense ontologies improve performance on visual reasoning tasks when properly filtered to select meaningful visual relations.

Image Retrieval Retrieval +2

Cannot find the paper you are looking for? You can Submit a new open access paper.