Search Results for author: Ashutosh Saxena

Found 36 papers, 3 papers with code

MDPs with Unawareness in Robotics

no code implementations20 May 2020 Nan Rong, Joseph Y. Halpern, Ashutosh Saxena

Doing this results in a family of systems, each of which has an extremely large action space, although only a few actions are "interesting".

Decision Making

Learning to Represent Haptic Feedback for Partially-Observable Tasks

no code implementations17 May 2017 Jaeyong Sung, J. Kenneth Salisbury, Ashutosh Saxena

The sense of touch, being the earliest sensory system to develop in a human body [1], plays a critical part of our daily interaction with the environment.

Q-Learning

Learning Transferrable Representations for Unsupervised Domain Adaptation

no code implementations NeurIPS 2016 Ozan Sener, Hyun Oh Song, Ashutosh Saxena, Silvio Savarese

Supervised learning with large scale labelled datasets and deep layered models has caused a paradigm shift in diverse areas in learning and recognition.

Object Recognition Unsupervised Domain Adaptation

Human Centred Object Co-Segmentation

no code implementations12 Jun 2016 Chenxia Wu, Jiemi Zhang, Ashutosh Saxena, Silvio Savarese

Co-segmentation is the automatic extraction of the common semantic regions given a set of images.

Human-Object Interaction Detection Object +1

Unsupervised Semantic Action Discovery from Video Collections

no code implementations11 May 2016 Ozan Sener, Amir Roshan Zamir, Chenxia Wu, Silvio Savarese, Ashutosh Saxena

Our method can also provide a textual description for each of the identified semantic steps and video segments.

Watch-n-Patch: Unsupervised Learning of Actions and Relations

no code implementations11 Mar 2016 Chenxia Wu, Jiemi Zhang, Ozan Sener, Bart Selman, Silvio Savarese, Ashutosh Saxena

For evaluation, we contribute a new challenging RGB-D activity video dataset recorded by the new Kinect v2, which contains several human daily activities as compositions of multiple actions interacting with different objects.

Action Segmentation Clustering

Unsupervised Transductive Domain Adaptation

no code implementations10 Feb 2016 Ozan Sener, Hyun Oh Song, Ashutosh Saxena, Silvio Savarese

We incorporate the domain shift and the transductive target inference into our framework by jointly solving for an asymmetric similarity metric and the optimal transductive target label assignment.

Object Recognition Unsupervised Domain Adaptation

Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding

no code implementations12 Jan 2016 Jaeyong Sung, Seok Hyun Jin, Ian Lenz, Ashutosh Saxena

There is a large variety of objects and appliances in human environments, such as stoves, coffee dispensers, juice extractors, and so on.

Structured Prediction

Brain4Cars: Car That Knows Before You Do via Sensory-Fusion Deep Learning Architecture

no code implementations5 Jan 2016 Ashesh Jain, Hema S. Koppula, Shane Soh, Bharad Raghavan, Avi Singh, Ashutosh Saxena

We introduce a diverse data set with 1180 miles of natural freeway and city driving, and show that we can anticipate maneuvers 3. 5 seconds before they occur in real-time with a precision and recall of 90. 5\% and 87. 4\% respectively.

Learning Preferences for Manipulation Tasks from Online Coactive Feedback

no code implementations5 Jan 2016 Ashesh Jain, Shikhar Sharma, Thorsten Joachims, Ashutosh Saxena

We consider the problem of learning preferences over trajectories for mobile manipulators such as personal robots and assembly line robots.

Watch-Bot: Unsupervised Learning for Reminding Humans of Forgotten Actions

no code implementations14 Dec 2015 Chenxia Wu, Jiemi Zhang, Bart Selman, Silvio Savarese, Ashutosh Saxena

We show that our approach not only improves the unsupervised action segmentation and action cluster assignment performance, but also effectively detects the forgotten actions on a challenging human activity RGB-D video dataset.

Action Segmentation Object

Hierarchical classification of e-commerce related social media

no code implementations26 Nov 2015 Matthew Long, Aditya Jami, Ashutosh Saxena

In this paper, we attempt to classify tweets into root categories of the Amazon browse node hierarchy using a set of tweets with browse node ID labels, a much larger set of tweets without labels, and a set of Amazon reviews.

Classification General Classification +2

Exploring Correlation between Labels to improve Multi-Label Classification

no code implementations25 Nov 2015 Amit Garg, Jonathan Noyola, Romil Verma, Ashutosh Saxena, Aditya Jami

This paper attempts multi-label classification by extending the idea of independent binary classification models for each output label, and exploring how the inherent correlation between output labels can be used to improve predictions.

Binary Classification Classification +3

Structural-RNN: Deep Learning on Spatio-Temporal Graphs

2 code implementations CVPR 2016 Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena

The proposed method is generic and principled as it can be used for transforming any spatio-temporal graph through employing a certain set of well defined steps.

Human Pose Forecasting Skeleton Based Action Recognition

Deep Multimodal Embedding: Manipulating Novel Objects with Point-clouds, Language and Trajectories

no code implementations25 Sep 2015 Jaeyong Sung, Ian Lenz, Ashutosh Saxena

A robot operating in a real-world environment needs to perform reasoning over a variety of sensor modalities such as vision, language and motion trajectories.

Recurrent Neural Networks for Driver Activity Anticipation via Sensory-Fusion Architecture

no code implementations16 Sep 2015 Ashesh Jain, Avi Singh, Hema S. Koppula, Shane Soh, Ashutosh Saxena

We introduce a sensory-fusion architecture which jointly learns to anticipate and fuse information from multiple sensory streams.

Unsupervised Semantic Parsing of Video Collections

no code implementations ICCV 2015 Ozan Sener, Amir Zamir, Silvio Savarese, Ashutosh Saxena

The proposed method is capable of providing a semantic "storyline" of the video composed of its objective steps.

Unsupervised semantic parsing

Watch-n-Patch: Unsupervised Understanding of Actions and Relations

no code implementations CVPR 2015 Chenxia Wu, Jiemi Zhang, Silvio Savarese, Ashutosh Saxena

For evaluation, we also contribute a new challenging RGB-D activity video dataset recorded by the new Kinect v2, which contains several human daily activities as compositions of multiple actions interacted with different objects.

Action Segmentation

Robobarista: Object Part based Transfer of Manipulation Trajectories from Crowd-sourcing in 3D Pointclouds

no code implementations13 Apr 2015 Jaeyong Sung, Seok Hyun Jin, Ashutosh Saxena

We formulate the manipulation planning as a structured prediction problem and design a deep learning model that can handle large noise in the manipulation demonstrations and learns features from three different modalities: point-clouds, language and trajectory.

Structured Prediction

Car that Knows Before You Do: Anticipating Maneuvers via Learning Temporal Driving Models

no code implementations ICCV 2015 Ashesh Jain, Hema S. Koppula, Bharad Raghavan, Shane Soh, Ashutosh Saxena

We evaluate our approach on a diverse data set with 1180 miles of natural freeway and city driving and show that we can anticipate maneuvers 3. 5 seconds before they occur with over 80\% F1-score in real-time.

RoboBrain: Large-Scale Knowledge Engine for Robots

no code implementations1 Dec 2014 Ashutosh Saxena, Ashesh Jain, Ozan Sener, Aditya Jami, Dipendra K. Misra, Hema S. Koppula

In this paper we introduce a knowledge engine, which learns and shares knowledge representations, for robots to carry out a variety of tasks.

MDPs with Unawareness

no code implementations27 Jul 2014 Joseph Y. Halpern, Nan Rong, Ashutosh Saxena

We define a new framework, MDPs with unawareness (MDPUs) to deal with the possibilities that a DM may not be aware of all possible actions.

Decision Making

PlanIt: A Crowdsourcing Approach for Learning to Plan Paths from Large Scale Preference Feedback

no code implementations10 Jun 2014 Ashesh Jain, Debarghya Das, Jayesh K. Gupta, Ashutosh Saxena

We represent trajectory preferences using a cost function that the robot learns and uses it to generate good trajectories in new environments.

Learning Trajectory Preferences for Manipulators via Iterative Improvement

no code implementations NeurIPS 2013 Ashesh Jain, Brian Wojcik, Thorsten Joachims, Ashutosh Saxena

In this paper, we propose a co-active online learning framework for teaching robots the preferences of its users for object manipulation tasks.

Synthesizing Manipulation Sequences for Under-Specified Tasks using Unrolled Markov Random Fields

no code implementations24 Jun 2013 Jaeyong Sung, Bart Selman, Ashutosh Saxena

Many tasks in human environments require performing a sequence of navigation and manipulation steps involving objects.

3D-Based Reasoning with Blocks, Support, and Stability

no code implementations CVPR 2013 Zhaoyin Jia, Andrew Gallagher, Ashutosh Saxena, Tsuhan Chen

Our algorithm incorporates the intuition that a good 3D representation of the scene is the one that fits the data well, and is a stable, self-supporting (i. e., one that does not topple) arrangement of objects.

Object

Hallucinated Humans as the Hidden Context for Labeling 3D Scenes

no code implementations CVPR 2013 Yun Jiang, Hema Koppula, Ashutosh Saxena

Given only a dataset of scenes containing objects but not humans, we show that our algorithm can recover the human object relationships.

Attribute Object +1

Deep Learning for Detecting Robotic Grasps

no code implementations16 Jan 2013 Ian Lenz, Honglak Lee, Ashutosh Saxena

We consider the problem of detecting robotic grasps in an RGB-D view of a scene containing objects.

Robotic Grasping

Learning Human Activities and Object Affordances from RGB-D Videos

no code implementations4 Oct 2012 Hema Swetha Koppula, Rudhir Gupta, Ashutosh Saxena

Given a RGB-D video, we jointly model the human activities and object affordances as a Markov random field where the nodes represent objects and sub-activities, and the edges represent the relationships between object affordances, their relations with sub-activities, and their evolution over time.

Descriptive Object +1

\theta-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding

no code implementations NeurIPS 2011 Cong-Cong Li, Ashutosh Saxena, Tsuhan Chen

For most scene understanding tasks (such as object detection or depth estimation), the classifiers need to consider contextual information in addition to the local features.

Depth Estimation object-detection +2

Semantic Labeling of 3D Point Clouds for Indoor Scenes

no code implementations NeurIPS 2011 Hema S. Koppula, Abhishek Anand, Thorsten Joachims, Ashutosh Saxena

In our experiments over a total of 52 3D scenes of homes and offices (composed from about 550 views, having 2495 segments labeled with 27 object classes), we get a performance of 84. 06% in labeling 17 object classes for offices, and 73. 38% in labeling 17 object classes for home scenes.

Object

Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

no code implementations NeurIPS 2010 Cong-Cong Li, Adarsh Kowdle, Ashutosh Saxena, Tsuhan Chen

In many machine learning domains (such as scene understanding), several related sub-tasks (such as scene categorization, depth estimation, object detection) operate on the same raw data and provide correlated outputs.

Classification Depth Estimation +7

Cascaded Classification Models: Combining Models for Holistic Scene Understanding

no code implementations NeurIPS 2008 Geremy Heitz, Stephen Gould, Ashutosh Saxena, Daphne Koller

We demonstrate the effectiveness of our method on a large set of natural images by combining the subtasks of scene categorization, object detection, multiclass image segmentation, and 3d scene reconstruction.

3D Reconstruction 3D Scene Reconstruction +7

Cannot find the paper you are looking for? You can Submit a new open access paper.