Search Results for author: Toshimitsu Ushio

Found 10 papers, 1 papers with code

Learning-based Bounded Synthesis for Semi-MDPs with LTL Specifications

no code implementations9 Apr 2022 Ryohei Oura, Toshimitsu Ushio

In the product of the SMDP and the deterministic $K$-co-B\"uchi automaton (d$K$cBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference.

Bayesian Inference

Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

no code implementations21 Jan 2022 Junya Ikemoto, Toshimitsu Ushio

Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems.

Decision Making reinforcement-learning +1

Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications

no code implementations3 Aug 2021 Junya Ikemoto, Toshimitsu Ushio

We apply deep reinforcement learning (DRL) to design of a networked controller with network delays to complete a temporal control task that is described by a signal temporal logic (STL) formula.

Reinforcement Learning (RL)

Collaborative rover-copter path planning and exploration with temporal logic specifications based on Bayesian update under uncertain environments

no code implementations20 Jul 2021 Kazumune Hashimoto, Natsuko Tsumagari, Toshimitsu Ushio

An exploration policy for the copter is then synthesized by employing the notion of an entropy that is evaluated based on the environmental beliefs of the atomic propositions, and a path that the rover intends to follow according to the optimal policy.

Bounded Synthesis and Reinforcement Learning of Supervisors for Stochastic Discrete Event Systems with LTL Specifications

no code implementations7 May 2021 Ryohei Oura, Toshimitsu Ushio, Ami Sakakibara

We compute a winning region and a directed controller with the maximum satisfaction probability using a dynamic programming method, where the expected return is used as a value function.

Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems

1 code implementation13 Jan 2021 Junya Ikemoto, Toshimitsu Ushio

If we know a mathematical model of a real system, a simulator is useful because it predicates behaviors of the real system using the mathematical model with a given system parameter vector.

Q-Learning Reinforcement Learning (RL)

Learning-based Symbolic Abstractions for Nonlinear Control Systems

no code implementations4 Apr 2020 Kazumune Hashimoto, Adnane Saoud, Masako Kishida, Toshimitsu Ushio, Dimos Dimarogonas

Symbolic models or abstractions are known to be powerful tools for the control design of cyber-physical systems (CPSs) with logic specifications.

Safe Exploration

Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Generalized Büchi Automata

no code implementations14 Jan 2020 Ryohei Oura, Ami Sakakibara, Toshimitsu Ushio

This letter proposes a novel reinforcement learning method for the synthesis of a control policy satisfying a control specification described by a linear temporal logic formula.

Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning

no code implementations28 Aug 2019 Junya Ikemoto, Toshimitsu Ushio

In this paper, we propose a design of a model-free networked controller for a nonlinear plant whose mathematical model is unknown.

Q-Learning

Model-free Control of Chaos with Continuous Deep Q-learning

no code implementations16 Jul 2019 Junya Ikemoto, Toshimitsu Ushio

Moreover, model-free reinforcement learning algorithms with deep neural networks have the disadvantage in taking much time to learn their control optimal policies.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.