Search Results for author: Shishir Kolathaya

Found 11 papers, 1 papers with code

Off-Policy Average Reward Actor-Critic with Deterministic Policy Search

no code implementations20 May 2023 Naman Saxena, Subhojyoti Khastigir, Shishir Kolathaya, Shalabh Bhatnagar

In this work, we present both on-policy and off-policy deterministic policy gradient theorems for the average reward performance criterion.

Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking

no code implementations22 Aug 2022 Eshwar S R, Shishir Kolathaya, Gugan Thoppe

This leads to a lot of wasteful interactions since, once the ranking is done, only the data associated with the top-ranked policies is used for subsequent learning.

Reinforcement Learning (RL)

Learning Linear Policies for Robust Bipedal Locomotion on Terrains with Varying Slopes

no code implementations4 Apr 2021 Lokesh Krishna, Utkarsh A. Mishra, Guillermo A. Castillo, Ayonga Hereid, Shishir Kolathaya

In this paper, with a view toward deployment of light-weight control frameworks for bipedal walking robots, we realize end-foot trajectories that are shaped by a single linear feedback policy.

Imitation Learning for High Precision Peg-in-Hole Tasks

no code implementations26 Dec 2020 Sagar Gubbi, Shishir Kolathaya, Bharadwaj Amrutur

Industrial robot manipulators are not able to match the precision and speed with which humans are able to execute contact rich tasks even to this day.

Imitation Learning Vocal Bursts Intensity Prediction

Multi-Instance Aware Localization for End-to-End Imitation Learning

no code implementations26 Dec 2020 Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur

Existing architectures for imitation learning using image-to-action policy networks perform poorly when presented with an input image containing multiple instances of the object of interest, especially when the number of expert demonstrations available for training are limited.

Imitation Learning

Stochastic Action Prediction for Imitation Learning

no code implementations26 Dec 2020 Sagar Gubbi Venkatesh, Nihesh Rathod, Shishir Kolathaya, Bharadwaj Amrutur

Imitation learning is a data-driven approach to acquiring skills that relies on expert demonstrations to learn a policy that maps observations to actions.

Action Generation Imitation Learning

Teaching Robots Novel Objects by Pointing at Them

no code implementations25 Dec 2020 Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur

Robots that must operate in novel environments and collaborate with humans must be capable of acquiring new knowledge from human experts during operation.

Object

Robust Quadrupedal Locomotion on Sloped Terrains: A Linear Policy Approach

1 code implementation30 Oct 2020 Kartik Paigwar, Lokesh Krishna, Sashank Tirumala, Naman Khetan, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

In particular, the parameters of the end-foot trajectories are shaped via a linear feedback policy that takes the torso orientation and the terrain slope as inputs.

Learning Stable Manoeuvres in Quadruped Robots from Expert Demonstrations

no code implementations28 Jul 2020 Sashank Tirumala, Sagar Gubbi, Kartik Paigwar, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

First, multiple simpler policies are trained to generate trajectories for a discrete set of target velocities and turning radius.

Learning Active Spine Behaviors for Dynamic and Efficient Locomotion in Quadruped Robots

no code implementations15 May 2019 Shounak Bhattacharya, Abhik Singla, Abhimanyu, Dhaivat Dholakiya, Shalabh Bhatnagar, Bharadwaj Amrutur, Ashitava Ghosal, Shishir Kolathaya

In this work, we provide a simulation framework to perform systematic studies on the effects of spinal joint compliance and actuation on bounding performance of a 16-DOF quadruped spined robot Stoch 2.

Realizing Learned Quadruped Locomotion Behaviors through Kinematic Motion Primitives

no code implementations9 Oct 2018 Abhik Singla, Shounak Bhattacharya, Dhaivat Dholakiya, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya

Leveraging on this underlying structure, we then realize walking in Stoch by a straightforward reconstruction of joint trajectories from kMPs.

Cannot find the paper you are looking for? You can Submit a new open access paper.