Search Results for author: Siddarth Jain

Found 5 papers, 0 papers with code

Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos

no code implementations27 Jun 2023 Chiori Hori, Puyuan Peng, David Harwath, Xinyu Liu, Kei Ota, Siddarth Jain, Radu Corcodel, Devesh Jha, Diego Romeres, Jonathan Le Roux

This paper introduces a method for robot action sequence generation from instruction videos using (1) an audio-visual Transformer that converts audio-visual features and instruction speech to a sequence of robot actions called dynamic movement primitives (DMPs) and (2) style-transfer-based training that employs multi-task learning with video captioning and weakly-supervised learning with a semantic classifier to exploit unpaired video-action data.

Multi-Task Learning Scene Understanding +3

Generalizable Human-Robot Collaborative Assembly Using Imitation Learning and Force Control

no code implementations2 Dec 2022 Devesh K. Jha, Siddarth Jain, Diego Romeres, William Yerazunis, Daniel Nikovski

In this paper, we present a system for human-robot collaborative assembly using learning from demonstration and pose estimation, so that the robot can adapt to the uncertainty caused by the operation of humans.

Imitation Learning Pose Estimation

Learning to Synthesize Volumetric Meshes from Vision-based Tactile Imprints

no code implementations29 Mar 2022 Xinghao Zhu, Siddarth Jain, Masayoshi Tomizuka, Jeroen van Baar

Vision-based tactile sensors typically utilize a deformable elastomer and a camera mounted above to provide high-resolution image observations of contacts.

Image Augmentation Robotic Grasping

InSeGAN: A Generative Approach to Segmenting Identical Instances in Depth Images

no code implementations ICCV 2021 Anoop Cherian, Goncalo Dias Pais, Siddarth Jain, Tim K. Marks, Alan Sullivan

To use our model for instance segmentation, we propose an instance pose encoder that learns to take in a generated depth image and reproduce the pose code vectors for all of the object instances.

Generative Adversarial Network Instance Segmentation +2

Cannot find the paper you are looking for? You can Submit a new open access paper.