Search Results for author: Patrick Wienhöft

Found 3 papers, 1 papers with code

More for Less: Safe Policy Improvement With Stronger Performance Guarantees

1 code implementation13 May 2023 Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen

In an offline reinforcement learning setting, the safe policy improvement (SPI) problem aims to improve the performance of a behavior policy according to which sample data has been generated.

Strategy Synthesis in Markov Decision Processes Under Limited Sampling Access

no code implementations22 Mar 2023 Christel Baier, Clemens Dubslaff, Patrick Wienhöft, Stefan J. Kiebel

A central task in control theory, artificial intelligence, and formal methods is to synthesize reward-maximizing strategies for agents that operate in partially unknown environments.

Novel Concepts reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.