Search Results for author: Prabhat Nagarajan

Found 8 papers, 4 papers with code

Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators

no code implementations16 Jul 2020 Yasuhiro Fujita, Kota Uenishi, Avinash Ummadisingu, Prabhat Nagarajan, Shimpei Masuda, Mario Ynocente Castro

Developing personal robots that can perform a diverse range of manipulation tasks in unstructured environments necessitates solving several challenges for robotic grasping systems.

reinforcement-learning Reinforcement Learning (RL) +1

Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning

1 code implementation1 Feb 2020 Zhang-Wei Hong, Prabhat Nagarajan, Guilherme Maeda

PIEKD is a learning framework that uses an ensemble of policies to act in the environment while periodically sharing knowledge amongst policies in the ensemble through knowledge distillation.

Knowledge Distillation reinforcement-learning +1

ChainerRL: A Deep Reinforcement Learning Library

1 code implementation9 Dec 2019 Yasuhiro Fujita, Prabhat Nagarajan, Toshiki Kataoka, Takahiro Ishikawa

In this paper, we introduce ChainerRL, an open-source deep reinforcement learning (DRL) library built using Python and the Chainer deep learning framework.

reinforcement-learning Reinforcement Learning (RL)

Learning Latent State Spaces for Planning through Reward Prediction

no code implementations9 Dec 2019 Aaron Havens, Yi Ouyang, Prabhat Nagarajan, Yasuhiro Fujita

The latent representation is learned exclusively from multi-step reward prediction which we show to be the only necessary information for successful planning.

Model-based Reinforcement Learning reinforcement-learning +1

Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observations

3 code implementations12 Apr 2019 Daniel S. Brown, Wonjoon Goo, Prabhat Nagarajan, Scott Niekum

A critical flaw of existing inverse reinforcement learning (IRL) methods is their inability to significantly outperform the demonstrator.

Imitation Learning reinforcement-learning +1

Deterministic Implementations for Reproducibility in Deep Reinforcement Learning

1 code implementation15 Sep 2018 Prabhat Nagarajan, Garrett Warnell, Peter Stone

One by one, we then allow individual sources of nondeterminism to affect our otherwise deterministic implementation, and measure the impact of each source on the variance in performance.

Q-Learning reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.