no code implementations • 8 Apr 2024 • Tobias Meggendorfer, Maximilian Weininger, Patrick Wienhöft
Markov decision processes (MDPs) are a fundamental model for decision making under uncertainty.
1 code implementation • 13 May 2023 • Patrick Wienhöft, Marnix Suilen, Thiago D. Simão, Clemens Dubslaff, Christel Baier, Nils Jansen
In an offline reinforcement learning setting, the safe policy improvement (SPI) problem aims to improve the performance of a behavior policy according to which sample data has been generated.
no code implementations • 22 Mar 2023 • Christel Baier, Clemens Dubslaff, Patrick Wienhöft, Stefan J. Kiebel
A central task in control theory, artificial intelligence, and formal methods is to synthesize reward-maximizing strategies for agents that operate in partially unknown environments.