TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Robot Manipulation	RLBench	PolarNet	Succ. Rate (18 tasks, 100 demo/task)	46.4	# 5
Robot Manipulation	RLBench	PolarNet	Succ. Rate (10 tasks, 100 demos/task)	89.8	# 1
Robot Manipulation	RLBench	PolarNet	Succ. Rate (74 tasks, 100 demos/task)	60.3	# 1
Robot Manipulation	RLBench	PolarNet	Input Image Size	128	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/polarnet-3d-point-clouds-for-language-guided/robot-manipulation-on-rlbench)](https://paperswithcode.com/sota/robot-manipulation-on-rlbench?p=polarnet-3d-point-clouds-for-language-guided)`

PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

27 Sep 2023 · ShiZhe Chen, Ricardo Garcia, Cordelia Schmid, Ivan Laptev ·

The ability for robots to comprehend and execute manipulation tasks based on natural language instructions is a long-term goal in robotics. The dominant approaches for language-guided manipulation use 2D image representations, which face difficulties in combining multi-view cameras and inferring precise 3D positions and relationships. To address these limitations, we propose a 3D point cloud based policy called PolarNet for language-guided manipulation. It leverages carefully designed point cloud inputs, efficient point cloud encoders, and multimodal transformers to learn 3D point cloud representations and integrate them with language instructions for action prediction. PolarNet is shown to be effective and data efficient in a variety of experiments conducted on the RLBench benchmark. It outperforms state-of-the-art 2D and 3D approaches in both single-task and multi-task learning. It also achieves promising results on a real robot.

PDF Abstract

Code

Add Remove Mark official

vlc-robot/polarnet official

Tasks

Add Remove

Multi-Task Learning

Robot Manipulation

Datasets

RLBench

Results from the Paper

Edit

Ranked #5 on Robot Manipulation on RLBench

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Robot Manipulation	RLBench	PolarNet	Succ. Rate (18 tasks, 100 demo/task)	46.4	# 5	Compare
			Succ. Rate (10 tasks, 100 demos/task)	89.8	# 1	Compare
			Succ. Rate (74 tasks, 100 demos/task)	60.3	# 1	Compare
			Input Image Size	128	# 1	Compare

Methods

Add Remove

PolarNet

Edit Social Preview

PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove