no code implementations • 31 Jul 2021 • Mark Rucker, Stephen Adams, Roy Hayes, Peter A. Beling
In this paper, the recovered reward are visually displayed, clustered using unsupervised learning, and classified using a supervised learner.
no code implementations • 2 Jul 2021 • Tyler Cody, Stephen Adams, Peter A. Beling
We consider the use of transfer distance in the design of machine rebuild procedures to allow for transferable prognostic models.
1 code implementation • 24 Jul 2020 • Jianyu Su, Stephen Adams, Peter A. Beling
To obtain a reasonable trade-off between training efficiency and algorithm performance, we extend value-decomposition to actor-critics that are compatible with A2C and propose a novel actor-critic framework, value-decomposition actor-critics (VDACs).
Multi-agent Reinforcement Learning Reinforcement Learning (RL) +2
no code implementations • 1 Apr 2020 • Jianyu Su, Stephen Adams, Peter A. Beling
The flexibility of the graph structure enables our method to be applicable to a variety of multi-agent systems, e. g. dynamic systems that consist of varying numbers of agents and static systems with a fixed number of agents.