no code implementations • 29 Jun 2023 • Martin Tappler, Edi Muškardin, Bernhard K. Aichernig, Bettina Könighofer
We aim to provide insights into the decisions faced by the agent by learning an automaton model of environmental behavior under the control of an agent.
1 code implementation • 29 Jun 2023 • Edi Muškardin, Martin Tappler, Ingo Pill, Bernhard K. Aichernig, Thomas Pock
We examine the assumption that the hidden-state vectors of recurrent neural networks (RNNs) tend to form clusters of semantically similar vectors, which we dub the clustering hypothesis.
1 code implementation • 4 Dec 2022 • Martin Tappler, Stefan Pranger, Bettina Könighofer, Edi Muškardin, Roderick Bloem, Kim Larsen
Iteratively, we use the collected data to learn new MDPs with higher accuracy, resulting in turn in shields able to prevent more safety violations.