Search Results for author: Matthew Schlegel

Found 10 papers, 1 papers with code

Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence

no code implementations27 Jan 2023 Lingwei Zhu, Zheng Chen, Matthew Schlegel, Martha White

Many policy optimization approaches in reinforcement learning incorporate a Kullback-Leilbler (KL) divergence to the previous policy, to prevent the policy from changing too quickly.

Atari Games reinforcement-learning +1

Meta-descent for Online, Continual Prediction

no code implementations17 Jul 2019 Andrew Jacobsen, Matthew Schlegel, Cameron Linke, Thomas Degris, Adam White, Martha White

This paper investigates different vector step-size adaptation approaches for non-stationary online, continual prediction problems.

Second-order methods Time Series +1

Importance Resampling for Off-policy Prediction

2 code implementations NeurIPS 2019 Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White

Importance sampling (IS) is a common reweighting strategy for off-policy prediction in reinforcement learning.

Context-Dependent Upper-Confidence Bounds for Directed Exploration

no code implementations NeurIPS 2018 Raksha Kumaraswamy, Matthew Schlegel, Adam White, Martha White

Directed exploration strategies for reinforcement learning are critical for learning an optimal policy in a minimal number of interactions with the environment.

Efficient Exploration

Importance Resampling for Off-policy Policy Evaluation

no code implementations27 Sep 2018 Matthew Schlegel, Wesley Chung, Daniel Graves, Martha White

We propose Importance Resampling (IR) for off-policy learning, that resamples experience from the replay buffer and applies a standard on-policy update.

General Value Function Networks

no code implementations18 Jul 2018 Matthew Schlegel, Andrew Jacobsen, Zaheer Abbas, Andrew Patterson, Adam White, Martha White

A general purpose strategy for state construction is to learn the state update using a Recurrent Neural Network (RNN), which updates the internal state using the current internal state and the most recent observation.

Continuous Control Decision Making

Discovery of Predictive Representations With a Network of General Value Functions

no code implementations ICLR 2018 Matthew Schlegel, Andrew Patterson, Adam White, Martha White

We investigate a framework for discovery: curating a large collection of predictions, which are used to construct the agent's representation of the world.

Decision Making

Adapting Kernel Representations Online Using Submodular Maximization

no code implementations ICML 2017 Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White

In this work, we develop an approximately submodular criterion for this setting, and an efficient online greedy submodular maximization algorithm for optimizing the criterion.

Continual Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.