no code implementations • 7 Jan 2024 • Robert Müller, Hasan Turalic, Thomy Phan, Michael Kölle, Jonas Nüßlein, Claudia Linnhoff-Popien
In the realm of Multi-Agent Reinforcement Learning (MARL), prevailing approaches exhibit shortcomings in aligning with human learning, robustness, and scalability.
no code implementations • 14 Dec 2023 • Michael Kölle, Afrae Ahouzi, Pascal Debus, Robert Müller, Danielle Schuman, Claudia Linnhoff-Popien
Quantum computing, with its potential to enhance various machine learning tasks, allows significant advancements in kernel calculation and model precision.
1 code implementation • 20 Jul 2023 • Jonas Stein, Ivo Christ, Nicolas Kraus, Maximilian Balthasar Mansky, Robert Müller, Claudia Linnhoff-Popien
As an application domain where the slightest qualitative improvements can yield immense value, finance is a promising candidate for early quantum advantage.
no code implementations • 9 Jun 2023 • Michael Kölle, Alessandro Giovagnoli, Jonas Stein, Maximilian Balthasar Mansky, Julian Hager, Tobias Rohe, Robert Müller, Claudia Linnhoff-Popien
Inspired by the remarkable success of artificial neural networks across a broad spectrum of AI tasks, variational quantum circuits (VQCs) have recently seen an upsurge in quantum machine learning applications.
no code implementations • 20 Dec 2022 • Steffen Illium, Thore Schillman, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien
Common to all different kinds of recurrent neural networks (RNNs) is the intention to model relations between data points through time.
no code implementations • 20 Dec 2022 • Steffen Illium, Robert Müller, Andreas Sedlmeier, Claudia-Linnhoff Popien
We apply the vision transformer, a deep machine learning model build around the attention mechanism, on mel-spectrogram representations of raw audio recordings.
no code implementations • 15 Jul 2022 • Kyrill Schmid, Lenz Belzner, Robert Müller, Johannes Tochtermann, Claudia Linnhoff-Popien
Some of the most relevant future applications of multi-agent systems like autonomous driving or factories as a service display mixed-motive scenarios, where agents might have conflicting goals.
1 code implementation • 12 Jun 2022 • Jonas Nüßlein, Steffen Illium, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien
As a prior, we assume that the higher-level strategy is to reach an unknown target state area, which we hypothesize is a valid prior for many domains in Reinforcement Learning.
no code implementations • 13 Mar 2022 • Bilal Shaikh, Lucian P. Smith, Dan Vasilescu, Gnaneswara Marupilla, Michael Wilson, Eran Agmon, Henry Agnew, Steven S. Andrews, Azraf Anwar, Moritz E. Beber, Frank T. Bergmann, David Brooks, Lutz Brusch, Laurence Calzone, Kiri Choi, Joshua Cooper, John Detloff, Brian Drawert, Michel Dumontier, G. Bard Ermentrout, James R. Faeder, Andrew P. Freiburger, Fabian Fröhlich, Akira Funahashi, Alan Garny, John H. Gennari, Padraig Gleeson, Anne Goelzer, Zachary Haiman, Joseph L. Hellerstein, Stefan Hoops, Jon C. Ison, Diego Jahn, Henry V. Jakubowski, Ryann Jordan, Matúš Kalaš, Matthias König, Wolfram Liebermeister, Synchon Mandal, Robert McDougal, J. Kyle Medley, Pedro Mendes, Robert Müller, Chris J. Myers, Aurelien Naldi, Tung V. N. Nguyen, David P. Nickerson, Brett G. Olivier, Drashti Patoliya, Loïc Paulevé, Linda R. Petzold, Ankita Priya, Anand K. Rampadarath, Johann M. Rohwer, Ali S. Saglam, Dilawar Singh, Ankur Sinha, Jacky Snoep, Hugh Sorby, Ryan Spangler, Jörn Starruß, Payton J. Thomas, David van Niekerk, Daniel Weindl, Fengkai Zhang, Anna Zhukova, Arthur P. Goldberg, Michael L. Blinov, Herbert M. Sauro, Ion I. Moraru, Jonathan R. Karr
To help investigators find and use simulation tools, we developed BioSimulators (https://biosimulators. org), a central registry of the capabilities of simulation tools and consistent Python, command-line, and containerized interfaces to each version of each tool.
no code implementations • 21 Jan 2022 • Robert Müller, Aldo Pacchiano
We study meta-learning in Markov Decision Processes (MDP) with linear transition models in the undiscounted episodic setting.
no code implementations • 14 Dec 2021 • Andreas Sedlmeier, Michael Kölle, Robert Müller, Leo Baudrexel, Claudia Linnhoff-Popien
In this work, we analyze existing and propose new metrics for the detection and quantification of multimodal uncertainty in RL based World Models.
no code implementations • 14 Dec 2020 • Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien
A characteristic of reinforcement learning is the ability to develop unforeseen strategies when solving problems.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 11 Dec 2020 • Robert Müller, Steffen Illium, Fabian Ritz, Kyrill Schmid
In this work, we thoroughly evaluate the efficacy of pretrained neural networks as feature extractors for anomalous sound detection.
no code implementations • 11 Dec 2020 • Robert Müller, Steffen Illium, Fabian Ritz, Tobias Schröder, Christian Platschek, Jörg Ochs, Claudia Linnhoff-Popien
In this work, we present a general procedure for acoustic leak detection in water networks that satisfies multiple real-world constraints such as energy efficiency and ease of deployment.
no code implementations • 11 Aug 2020 • Steffen Illium, Robert Müller, Andreas Sedlmeier, Claudia Linnhoff-Popien
In many fields of research, labeled datasets are hard to acquire.
no code implementations • ICML Workshop LifelongML 2020 • Robert Müller, Jack Parker-Holder, Aldo Pacchiano
Meta-learning is a paradigm whereby an agent is trained with the specific goal of fast adaptation.
no code implementations • 5 Jun 2020 • Robert Müller, Fabian Ritz, Steffen Illium, Claudia Linnhoff-Popien
In industrial applications, the early detection of malfunctioning factory machinery is crucial.
no code implementations • 25 May 2020 • Andreas Sedlmeier, Robert Müller, Steffen Illium, Claudia Linnhoff-Popien
One critical prerequisite for the deployment of reinforcement learning systems in the real world is the ability to reliably detect situations on which the agent was not trained.
no code implementations • 5 Aug 2019 • Stefan Langer, Robert Müller, Kyrill Schmid, Claudia Linnhoff-Popien
The difficulty of mountainbike downhill trails is a subjective perception.
no code implementations • 30 Jul 2019 • Robert Müller, Stefan Langer, Fabian Ritz, Christoph Roch, Steffen Illium, Claudia Linnhoff-Popien
In this work we present STEVE - Soccer TEam VEctors, a principled approach for learning real valued vectors for soccer teams where similar teams are close to each other in the resulting vector space.
1 code implementation • 11 Jul 2019 • Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien
We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a general memory bounded approach to partially observable open-loop planning.
no code implementations • 5 Jul 2019 • Daniel Elsner, Stefan Langer, Fabian Ritz, Robert Müller, Steffen Illium
Detecting sleepiness from spoken language is an ambitious task, which is addressed by the Interspeech 2019 Computational Paralinguistics Challenge (ComParE).