Search Results for author: Toshimitsu Ushio

Found 10 papers, 1 papers with code

Learning-based Bounded Synthesis for Semi-MDPs with LTL Specifications

no code implementations • 9 Apr 2022 • Ryohei Oura, Toshimitsu Ushio

In the product of the SMDP and the deterministic $K$-co-B\"uchi automaton (d$K$cBA) converted from the LTL specification, we learn both the winning region of satisfying the LTL specification and the dynamics therein based on reinforcement learning and Bayesian inference.

Bayesian Inference

Paper
Add Code

Deep reinforcement learning under signal temporal logic constraints using Lagrangian relaxation

no code implementations • 21 Jan 2022 • Junya Ikemoto, Toshimitsu Ushio

Deep reinforcement learning (DRL) has attracted much attention as an approach to solve optimal control problems without mathematical models of systems.

Decision Making reinforcement-learning +1

Paper
Add Code

Deep Reinforcement Learning Based Networked Control with Network Delays for Signal Temporal Logic Specifications

no code implementations • 3 Aug 2021 • Junya Ikemoto, Toshimitsu Ushio

We apply deep reinforcement learning (DRL) to design of a networked controller with network delays to complete a temporal control task that is described by a signal temporal logic (STL) formula.

Reinforcement Learning (RL)

Paper
Add Code

Collaborative rover-copter path planning and exploration with temporal logic specifications based on Bayesian update under uncertain environments

no code implementations • 20 Jul 2021 • Kazumune Hashimoto, Natsuko Tsumagari, Toshimitsu Ushio

An exploration policy for the copter is then synthesized by employing the notion of an entropy that is evaluated based on the environmental beliefs of the atomic propositions, and a path that the rover intends to follow according to the optimal policy.

Paper
Add Code

Bounded Synthesis and Reinforcement Learning of Supervisors for Stochastic Discrete Event Systems with LTL Specifications

no code implementations • 7 May 2021 • Ryohei Oura, Toshimitsu Ushio, Ami Sakakibara

We compute a winning region and a directed controller with the maximum satisfaction probability using a dynamic programming method, where the expected return is used as a value function.

Paper
Add Code

Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems

1 code implementation • 13 Jan 2021 • Junya Ikemoto, Toshimitsu Ushio

If we know a mathematical model of a real system, a simulator is useful because it predicates behaviors of the real system using the mathematical model with a given system parameter vector.

Q-Learning Reinforcement Learning (RL)

Paper
Code

Learning-based Symbolic Abstractions for Nonlinear Control Systems

no code implementations • 4 Apr 2020 • Kazumune Hashimoto, Adnane Saoud, Masako Kishida, Toshimitsu Ushio, Dimos Dimarogonas

Symbolic models or abstractions are known to be powerful tools for the control design of cyber-physical systems (CPSs) with logic specifications.

Safe Exploration

Paper
Add Code

Reinforcement Learning of Control Policy for Linear Temporal Logic Specifications Using Limit-Deterministic Generalized Büchi Automata

no code implementations • 14 Jan 2020 • Ryohei Oura, Ami Sakakibara, Toshimitsu Ushio

This letter proposes a novel reinforcement learning method for the synthesis of a control policy satisfying a control specification described by a linear temporal logic formula.

Paper
Add Code

Networked Control of Nonlinear Systems under Partial Observation Using Continuous Deep Q-Learning

no code implementations • 28 Aug 2019 • Junya Ikemoto, Toshimitsu Ushio

In this paper, we propose a design of a model-free networked controller for a nonlinear plant whose mathematical model is unknown.

Q-Learning

Paper
Add Code

Model-free Control of Chaos with Continuous Deep Q-learning

no code implementations • 16 Jul 2019 • Junya Ikemoto, Toshimitsu Ushio

Moreover, model-free reinforcement learning algorithms with deep neural networks have the disadvantage in taking much time to learn their control optimal policies.

Q-Learning reinforcement-learning +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.