no code implementations • 11 Apr 2024 • Stefan Sylvius Wagner, Maike Behrendt, Marc Ziegele, Stefan Harmeling
In this work, we present two different ways to leverage LLM-generated synthetic data to train and improve stance detection agents for online political discussions: first, we show that augmenting a small fine-tuning dataset with synthetic data can improve the performance of the stance detection model.
1 code implementation • 3 Apr 2024 • Maike Behrendt, Stefan Sylvius Wagner, Marc Ziegele, Lena Wilms, Anke Stoll, Dominique Heinbach, Stefan Harmeling
In this work, we introduce AQuA, an additive score that calculates a unified deliberative quality score from multiple indices for each discussion post.
no code implementations • 5 Feb 2024 • Stefan Sylvius Wagner, Stefan Harmeling
In this paper we adopt a representation-centric perspective on exploration in reinforcement learning, viewing exploration fundamentally as a density estimation problem.
no code implementations • 30 Aug 2023 • Stefan Sylvius Wagner, Peter Arndt, Jan Robine, Stefan Harmeling
In environments with sparse rewards, finding a good inductive bias for exploration is crucial to the agent's success.
2 code implementations • 22 Aug 2023 • Tobias Uelwer, Jan Robine, Stefan Sylvius Wagner, Marc Höftmann, Eric Upschulte, Sebastian Konietzny, Maike Behrendt, Stefan Harmeling
Learning meaningful representations is at the heart of many tasks in the field of modern machine learning.