no code implementations • 20 May 2023 • Naman Saxena, Subhojyoti Khastigir, Shishir Kolathaya, Shalabh Bhatnagar
In this work, we present both on-policy and off-policy deterministic policy gradient theorems for the average reward performance criterion.
no code implementations • 22 Aug 2022 • Eshwar S R, Shishir Kolathaya, Gugan Thoppe
This leads to a lot of wasteful interactions since, once the ranking is done, only the data associated with the top-ranked policies is used for subsequent learning.
no code implementations • 4 Apr 2021 • Lokesh Krishna, Utkarsh A. Mishra, Guillermo A. Castillo, Ayonga Hereid, Shishir Kolathaya
In this paper, with a view toward deployment of light-weight control frameworks for bipedal walking robots, we realize end-foot trajectories that are shaped by a single linear feedback policy.
no code implementations • 26 Dec 2020 • Sagar Gubbi, Shishir Kolathaya, Bharadwaj Amrutur
Industrial robot manipulators are not able to match the precision and speed with which humans are able to execute contact rich tasks even to this day.
no code implementations • 26 Dec 2020 • Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur
Existing architectures for imitation learning using image-to-action policy networks perform poorly when presented with an input image containing multiple instances of the object of interest, especially when the number of expert demonstrations available for training are limited.
no code implementations • 26 Dec 2020 • Sagar Gubbi Venkatesh, Nihesh Rathod, Shishir Kolathaya, Bharadwaj Amrutur
Imitation learning is a data-driven approach to acquiring skills that relies on expert demonstrations to learn a policy that maps observations to actions.
no code implementations • 25 Dec 2020 • Sagar Gubbi Venkatesh, Raviteja Upadrashta, Shishir Kolathaya, Bharadwaj Amrutur
Robots that must operate in novel environments and collaborate with humans must be capable of acquiring new knowledge from human experts during operation.
1 code implementation • 30 Oct 2020 • Kartik Paigwar, Lokesh Krishna, Sashank Tirumala, Naman Khetan, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya
In particular, the parameters of the end-foot trajectories are shaped via a linear feedback policy that takes the torso orientation and the terrain slope as inputs.
no code implementations • 28 Jul 2020 • Sashank Tirumala, Sagar Gubbi, Kartik Paigwar, Aditya Sagi, Ashish Joglekar, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya
First, multiple simpler policies are trained to generate trajectories for a discrete set of target velocities and turning radius.
no code implementations • 15 May 2019 • Shounak Bhattacharya, Abhik Singla, Abhimanyu, Dhaivat Dholakiya, Shalabh Bhatnagar, Bharadwaj Amrutur, Ashitava Ghosal, Shishir Kolathaya
In this work, we provide a simulation framework to perform systematic studies on the effects of spinal joint compliance and actuation on bounding performance of a 16-DOF quadruped spined robot Stoch 2.
no code implementations • 9 Oct 2018 • Abhik Singla, Shounak Bhattacharya, Dhaivat Dholakiya, Shalabh Bhatnagar, Ashitava Ghosal, Bharadwaj Amrutur, Shishir Kolathaya
Leveraging on this underlying structure, we then realize walking in Stoch by a straightforward reconstruction of joint trajectories from kMPs.