no code implementations • 9 Apr 2022 • Ryohei Oura, Toshimitsu Ushio
In the product of the SMDP and the deterministic $K$-co-B\"uchi automaton (d$K$cBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference.
no code implementations • 21 Jan 2022 • Junya Ikemoto, Toshimitsu Ushio
Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems.
no code implementations • 3 Aug 2021 • Junya Ikemoto, Toshimitsu Ushio
We apply deep reinforcement learning (DRL) to design of a networked controller with network delays to complete a temporal control task that is described by a signal temporal logic (STL) formula.
no code implementations • 20 Jul 2021 • Kazumune Hashimoto, Natsuko Tsumagari, Toshimitsu Ushio
An exploration policy for the copter is then synthesized by employing the notion of an entropy that is evaluated based on the environmental beliefs of the atomic propositions, and a path that the rover intends to follow according to the optimal policy.
no code implementations • 7 May 2021 • Ryohei Oura, Toshimitsu Ushio, Ami Sakakibara
We compute a winning region and a directed controller with the maximum satisfaction probability using a dynamic programming method, where the expected return is used as a value function.
1 code implementation • 13 Jan 2021 • Junya Ikemoto, Toshimitsu Ushio
If we know a mathematical model of a real system, a simulator is useful because it predicates behaviors of the real system using the mathematical model with a given system parameter vector.
no code implementations • 4 Apr 2020 • Kazumune Hashimoto, Adnane Saoud, Masako Kishida, Toshimitsu Ushio, Dimos Dimarogonas
Symbolic models or abstractions are known to be powerful tools for the control design of cyber-physical systems (CPSs) with logic specifications.
no code implementations • 14 Jan 2020 • Ryohei Oura, Ami Sakakibara, Toshimitsu Ushio
This letter proposes a novel reinforcement learning method for the synthesis of a control policy satisfying a control specification described by a linear temporal logic formula.
no code implementations • 28 Aug 2019 • Junya Ikemoto, Toshimitsu Ushio
In this paper, we propose a design of a model-free networked controller for a nonlinear plant whose mathematical model is unknown.
no code implementations • 16 Jul 2019 • Junya Ikemoto, Toshimitsu Ushio
Moreover, model-free reinforcement learning algorithms with deep neural networks have the disadvantage in taking much time to learn their control optimal policies.