Search Results for author: Ryuki Tachibana

Found 18 papers, 4 papers with code

Ensemble of Discriminators for Domain Adaptation in Multiple Sound Source 2D Localization

no code implementations • 10 Dec 2020 • Guillaume Le Moing, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Phongtharin Vinayavekhin

This paper introduces an ensemble of discriminators that improves the accuracy of a domain adaptation technique for the localization of multiple sound sources.

Domain Adaptation

Paper
Add Code

Learning Multiple Sound Source 2D Localization

no code implementations • 10 Dec 2020 • Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante

In this paper, we propose novel deep learning based algorithms for multiple sound source localization.

Paper
Add Code

Data-Efficient Framework for Real-world Multiple Sound Source 2D Localization

no code implementations • 10 Dec 2020 • Guillaume Le Moing, Phongtharin Vinayavekhin, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana

Moreover, learning for different microphone array layouts makes the task more complicated due to the infinite number of possible layouts.

Paper
Add Code

Q-learning with Language Model for Edit-based Unsupervised Summarization

1 code implementation • EMNLP 2020 • Ryosuke Kohita, Akifumi Wachi, Yang Zhao, Ryuki Tachibana

Q-learning is leveraged to train the agent to produce proper edit actions.

Abstractive Text Summarization Language Modelling +1

Paper
Code

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

1 code implementation • EMNLP 2020 • Subhajit Chaudhury, Daiki Kimura, Kartik Talamadupula, Michiaki Tatsubori, Asim Munawar, Ryuki Tachibana

Our bootstrapped agent shows improved generalization in solving unseen TextWorld games, using 10x-20x fewer training games compared to previous state-of-the-art methods despite requiring less number of training episodes.

Q-Learning Reinforcement Learning (RL) +1

Paper
Code

Unsupervised Temporal Feature Aggregation for Event Detection in Unstructured Sports Videos

no code implementations • 19 Feb 2020 • Subhajit Chaudhury, Daiki Kimura, Phongtharin Vinayavekhin, Asim Munawar, Ryuki Tachibana, Koji Ito, Yuki Inaba, Minoru Matsumoto, Shuji Kidokoro, Hiroki Ozaki

In this paper, we study the case of event detection in sports videos for unstructured environments with arbitrary camera angles.

Data Augmentation Event Detection +3

Paper
Add Code

Spatially-weighted Anomaly Detection with Regression Model

no code implementations • 23 Mar 2019 • Daiki Kimura, Minori Narita, Asim Munawar, Ryuki Tachibana

Visual anomaly detection is common in several applications including medical screening and production quality check.

Anomaly Detection regression

Paper
Add Code

Spatially-weighted Anomaly Detection

no code implementations • 5 Oct 2018 • Minori Narita, Daiki Kimura, Ryuki Tachibana

Many types of anomaly detection methods have been proposed recently, and applied to a wide variety of fields including medical screening and production quality checking.

Anomaly Detection

Paper
Add Code

Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning

no code implementations • 2 Oct 2018 • Subhajit Chaudhury, Daiki Kimura, Asim Munawar, Ryuki Tachibana

Experimental results show that the proposed adversarial learning method from raw videos produces a similar performance to state-of-the-art imitation learning techniques while frequently outperforming existing hand-crafted video imitation methods.

Imitation Learning

Paper
Add Code

Constrained Exploration and Recovery from Experience Shaping

1 code implementation • 21 Sep 2018 • Tu-Hoa Pham, Giovanni De Magistris, Don Joven Agravante, Subhajit Chaudhury, Asim Munawar, Ryuki Tachibana

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding undesirable actions or states, associated to lower rewards, or penalties.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Reinforcement Learning Testbed for Power-Consumption Optimization

1 code implementation • 21 Aug 2018 • Takao Moriyama, Giovanni De Magistris, Michiaki Tatsubori, Tu-Hoa Pham, Asim Munawar, Ryuki Tachibana

Common approaches to control a data-center cooling system rely on approximated system/environment models that are built upon the knowledge of mechanical cooling and electrical and thermal management.

Systems and Control

171

Paper
Code

Focusing on What is Relevant: Time-Series Learning and Understanding using Attention

no code implementations • 22 Jun 2018 • Phongtharin Vinayavekhin, Subhajit Chaudhury, Asim Munawar, Don Joven Agravante, Giovanni De Magistris, Daiki Kimura, Ryuki Tachibana

This paper is a contribution towards interpretability of the deep learning models in different applications of time-series.

General Classification Time Series +1

Paper
Add Code

Internal Model from Observations for Reward Shaping

no code implementations • 2 Jun 2018 • Daiki Kimura, Subhajit Chaudhury, Ryuki Tachibana, Sakyasingha Dasgupta

During reinforcement learning, the agent predicts the reward as a function of the difference between the actual state and the state predicted by the internal model.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Model-based imitation learning from state trajectories

no code implementations • ICLR 2018 • Subhajit Chaudhury, Daiki Kimura, Tadanobu Inoue, Ryuki Tachibana

We present a model-based imitation learning method that can learn environment-specific optimal actions only from expert state trajectories.

Imitation Learning reinforcement-learning +1

Paper
Add Code

Reward Estimation via State Prediction

no code implementations • ICLR 2018 • Daiki Kimura, Subhajit Chaudhury, Ryuki Tachibana, Sakyasingha Dasgupta

We present a novel reward estimation method that is based on a finite sample of optimal state trajectories from expert demon- strations and can be used for guiding an agent to mimic the expert behavior.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World

no code implementations • 22 Sep 2017 • Tu-Hoa Pham, Giovanni De Magistris, Ryuki Tachibana

While deep reinforcement learning techniques have recently produced considerable achievements on many decision-making problems, their use in robotics has largely been limited to simulated worlds or restricted motions, since unconstrained trial-and-error interactions in the real world can have undesirable consequences for the robot or its environment.

Decision Making reinforcement-learning +1

Paper
Add Code

Deep Reinforcement Learning for High Precision Assembly Tasks

no code implementations • 14 Aug 2017 • Tadanobu Inoue, Giovanni De Magistris, Asim Munawar, Tsuyoshi Yokoya, Ryuki Tachibana

High precision assembly of mechanical parts requires accuracy exceeding the robot precision.

Position reinforcement-learning +2

Paper
Add Code

Conditional generation of multi-modal data using constrained embedding space mapping

no code implementations • 4 Jul 2017 • Subhajit Chaudhury, Sakyasingha Dasgupta, Asim Munawar, Md. A. Salam Khan, Ryuki Tachibana

We present a conditional generative model that maps low-dimensional embeddings of multiple modalities of data to a common latent space hence extracting semantic relationships between them.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.