Search Results for author: Nur Muhammad Mahi Shafiullah

Found 6 papers, 5 papers with code

Behavior Generation with Latent Actions

1 code implementation • 5 Mar 2024 • Seungjae Lee, Yibin Wang, Haritheja Etukuru, H. Jin Kim, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

Unlike language or image generation, decision making requires modeling actions - continuous-valued vectors that are multimodal in their distribution, potentially drawn from uncurated sources, where generation errors can compound in sequential prediction.

Autonomous Driving Decision Making +2

Paper
Code

OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics

1 code implementation • 22 Jan 2024 • Peiqi Liu, Yaswanth Orru, Jay Vakil, Chris Paxton, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

The results demonstrate that OK-Robot achieves a 58. 5% success rate in open-ended pick-and-drop tasks, representing a new state-of-the-art in Open Vocabulary Mobile Manipulation (OVMM) with nearly 1. 8x the performance of prior work.

object-detection Object Detection

366

Paper
Code

On Bringing Robots Home

1 code implementation • 27 Nov 2023 • Nur Muhammad Mahi Shafiullah, Anant Rai, Haritheja Etukuru, Yiqian Liu, Ishan Misra, Soumith Chintala, Lerrel Pinto

We use the Stick to collect 13 hours of data in 22 homes of New York City, and train Home Pretrained Representations (HPR).

517

Paper
Code

From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data

no code implementations • 18 Oct 2022 • Zichen Jeff Cui, Yibin Wang, Nur Muhammad Mahi Shafiullah, Lerrel Pinto

While large-scale sequence modeling from offline data has led to impressive performance gains in natural language and image generation, directly translating such ideas to robotics has been challenging.

Image Generation

Paper
Add Code

CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory

2 code implementations • 11 Oct 2022 • Nur Muhammad Mahi Shafiullah, Chris Paxton, Lerrel Pinto, Soumith Chintala, Arthur Szlam

We propose CLIP-Fields, an implicit scene model that can be used for a variety of tasks, such as segmentation, instance identification, semantic search over space, and view localization.

Segmentation Semantic Segmentation +1

141

Paper
Code

Behavior Transformers: Cloning $k$ modes with one stone

2 code implementations • 22 Jun 2022 • Nur Muhammad Mahi Shafiullah, Zichen Jeff Cui, Ariuntuya Altanzaya, Lerrel Pinto

In this work, we present Behavior Transformer (BeT), a new technique to model unlabeled demonstration data with multiple modes.

Object Detection Offline RL

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.