Search Results for author: Aivar Sootla

Found 12 papers, 3 papers with code

Structured Q-learning For Antibody Design

no code implementations • 10 Sep 2022 • Alexander I. Cowen-Rivers, Philip John Gorinski, Aivar Sootla, Asif Khan, Liu Furui, Jun Wang, Jan Peters, Haitham Bou Ammar

Optimizing combinatorial structures is core to many real-world problems, such as those encountered in life sciences.

Combinatorial Optimization Molecular Docking +1

Paper
Add Code

Effects of Safety State Augmentation on Safe Exploration

1 code implementation • 6 Jun 2022 • Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

We further show that Simmer can stabilize training and improve the performance of safe RL with average constraints.

Reinforcement Learning (RL) Safe Exploration +1

2,974

Paper
Code

Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

no code implementations • 31 May 2022 • David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

In this paper, we introduce a reinforcement learning (RL) framework named \textbf{L}earnable \textbf{I}mpulse \textbf{C}ontrol \textbf{R}einforcement \textbf{A}lgorithm (LICRA), for learning to optimally select both when to act and which actions to take when actions incur costs.

Reinforcement Learning (RL)

Paper
Add Code

SEREN: Knowing When to Explore and When to Exploit

no code implementations • 30 May 2022 • Changmin Yu, David Mguni, Dong Li, Aivar Sootla, Jun Wang, Neil Burgess

Efficient reinforcement learning (RL) involves a trade-off between "exploitative" actions that maximise expected reward and "explorative'" ones that sample unvisited states.

Reinforcement Learning (RL)

Paper
Add Code

Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

1 code implementation • 14 Feb 2022 • Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

Satisfying safety constraints almost surely (or with probability one) can be critical for the deployment of Reinforcement Learning (RL) in real-life applications.

reinforcement-learning Reinforcement Learning (RL) +1

2,974

Paper
Code

Reinforcement Learning in Presence of Discrete Markovian Context Evolution

no code implementations • ICLR 2022 • Hang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar

We consider a context-dependent Reinforcement Learning (RL) setting, which is characterized by: a) an unknown finite number of not directly observable contexts; b) abrupt (discontinuous) context changes occurring during an episode; and c) Markovian context evolution.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

no code implementations • 27 Oct 2021 • David Mguni, Usman Islam, Yaqi Sun, Xiuling Zhang, Joel Jennings, Aivar Sootla, Changmin Yu, Ziyan Wang, Jun Wang, Yaodong Yang

In this paper, we introduce a new generation of RL solvers that learn to minimise safety violations while maximising the task reward to the extent that can be tolerated by the safe policy.

OpenAI Gym reinforcement-learning +3

Paper
Add Code

Viscos Flows: Variational Schur Conditional Sampling With Normalizing Flows

no code implementations • 6 Jul 2021 • Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang

We present a method for conditional sampling for pre-trained normalizing flows when only part of an observation is available.

Paper
Add Code

Diagnosing and Preventing Instabilities in Recurrent Video Processing

no code implementations • 10 Oct 2020 • Thomas Tanay, Aivar Sootla, Matteo Maggioni, Puneet K. Dokania, Philip Torr, Ales Leonardis, Gregory Slabaugh

Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-resolution.

Denoising Super-Resolution +2

Paper
Add Code

SAMBA: Safe Model-Based & Active Reinforcement Learning

1 code implementation • 12 Jun 2020 • Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar

In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics.

Reinforcement Learning (RL) Safe Reinforcement Learning

155

Paper
Code

Distributed Reconstruction of Nonlinear Networks: An ADMM Approach

no code implementations • 28 Mar 2014 • Wei Pan, Aivar Sootla, Guy-Bart Stan

In this paper, we present a distributed algorithm for the reconstruction of large-scale nonlinear networks.

Time Series Time Series Analysis

Paper
Add Code

Toggling a Genetic Switch Using Reinforcement Learning

no code implementations • 12 Mar 2013 • Aivar Sootla, Natalja Strelkowa, Damien Ernst, Mauricio Barahona, Guy-Bart Stan

In this paper, we consider the problem of optimal exogenous control of gene regulatory networks.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.