TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
CARLA MAP Leaderboard	CARLA	GRI-based DRL	Driving score	33.785	# 4
CARLA MAP Leaderboard	CARLA	GRI-based DRL	Route completion	57.442	# 5
CARLA MAP Leaderboard	CARLA	GRI-based DRL	Infraction penalty	0.568	# 5
Autonomous Driving	CARLA Leaderboard	GRIAD	Driving Score	36.79	# 10
Autonomous Driving	CARLA Leaderboard	GRIAD	Route Completion	61.85	# 12
Autonomous Driving	CARLA Leaderboard	GRIAD	Infraction penalty	0.6	# 12

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gri-general-reinforced-imitation-and-its/carla-map-leaderboard-on-carla)](https://paperswithcode.com/sota/carla-map-leaderboard-on-carla?p=gri-general-reinforced-imitation-and-its)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/gri-general-reinforced-imitation-and-its/autonomous-driving-on-carla-leaderboard)](https://paperswithcode.com/sota/autonomous-driving-on-carla-leaderboard?p=gri-general-reinforced-imitation-and-its)`

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

16 Nov 2021 · Raphael Chekroun, Marin Toromanoff, Sascha Hornauer, Fabien Moutarde ·

Deep reinforcement learning (DRL) has been demonstrated to be effective for several complex decision-making applications such as autonomous driving and robotics. However, DRL is notoriously limited by its high sample complexity and its lack of stability. Prior knowledge, e.g. as expert demonstrations, is often available but challenging to leverage to mitigate these issues. In this paper, we propose General Reinforced Imitation (GRI), a novel method which combines benefits from exploration and expert data and is straightforward to implement over any off-policy RL algorithm. We make one simplifying hypothesis: expert demonstrations can be seen as perfect data whose underlying policy gets a constant high reward. Based on this assumption, GRI introduces the notion of offline demonstration agents. This agent sends expert data which are processed both concurrently and indistinguishably with the experiences coming from the online RL exploration agent. We show that our approach enables major improvements on vision-based autonomous driving in urban environments. We further validate the GRI method on Mujoco continuous control tasks with different off-policy RL algorithms. Our method ranked first on the CARLA Leaderboard and outperforms World on Rails, the previous state-of-the-art, by 17%.

PDF Abstract

Code

Add Remove Mark official

No code implementations yet. Submit your code now

Tasks

Add Remove

Autonomous Driving

CARLA MAP Leaderboard

Continuous Control

Decision Making

Datasets

MuJoCo

CARLA

Results from the Paper

Edit

Ranked #4 on CARLA MAP Leaderboard on CARLA

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
CARLA MAP Leaderboard	CARLA	GRI-based DRL	Driving score	33.785	# 4	Compare
			Route completion	57.442	# 5	Compare
			Infraction penalty	0.568	# 5	Compare
Autonomous Driving	CARLA Leaderboard	GRIAD	Driving Score	36.79	# 10	Compare
			Route Completion	61.85	# 12	Compare
			Infraction penalty	0.6	# 12	Compare

Methods

Add Remove

CARLA • Entropy Regularization • PPO

Edit Social Preview

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove