General Reinforcement Learning

35 papers with code • 6 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in General Reinforcement Learning

Dataset	Best Model	Compare
Obstacle Tower (No Gen) fixed	RNB	See all
Obstacle Tower (No Gen) varied	RNB	See all
Obstacle Tower (Weak Gen) fixed	PPO	See all
Obstacle Tower (Weak Gen) varied	RNB	See all
Obstacle Tower (Strong Gen) fixed	PPO	See all
Obstacle Tower (Strong Gen) varied	RNB	See all

Libraries

Use these libraries to find General Reinforcement Learning models and implementations

opendilab/DI-engine

2 papers

2,539

aslanides/aixijs

2 papers

148

atavakol/action-hypergraph-networks

2 papers

Datasets

Subtasks

Latest papers

Most implemented Social Latest No code

End-to-End Egospheric Spatial Memory

ivy-dl/memory • • 15 Feb 2021

Spatial memory, or the ability to remember and recall specific locations and objects, is central to autonomous agents' ability to carry out tasks in real environments.

15 Feb 2021

Paper
Code

Interactive Learning from Activity Description

khanhptnk/iliad • • 13 Feb 2021

We present a novel interactive learning protocol that enables training request-fulfilling agents by verbally describing their activities.

13 Feb 2021

Paper
Code

Learning to Represent Action Values as a Hypergraph on the Action Vertices

atavakol/action-hypergraph-networks • • ICLR 2021

To test this, we set forth the action hypergraph networks framework -- a class of functions for learning action representations in multi-dimensional discrete action spaces with a structural inductive bias.

28 Oct 2020

Paper
Code

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

ml-jku/align-rudder • 29 Sep 2020

For such complex tasks, the recently proposed RUDDER uses reward redistribution to leverage steps in the Q-function that are associated with accomplishing sub-tasks.

29 Sep 2020

Paper
Code

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

adipandas/gym_multirotor • • 15 Jul 2020

The results show faster learning with the presented approach as opposed to learning the control policy from scratch for this new UAV design created by modifications in a conventional quadcopter, i. e., the addition of more degrees of freedom (4-actuators in conventional quadcopter to 8-actuators in tilt-rotor quadcopter).

15 Jul 2020

Paper
Code

Data-Efficient Reinforcement Learning with Self-Predictive Representations

mila-iqia/spr • • ICLR 2021

We further improve performance by adding data augmentation to the future prediction loss, which forces the agent's representations to be consistent across multiple views of an observation.

156

12 Jul 2020

Paper
Code

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

chandar-lab/LoCA • • NeurIPS 2020

For example, the common single-task sample-efficiency metric conflates improvements due to model-based learning with various other aspects, such as representation learning, making it difficult to assess true progress on model-based RL.

07 Jul 2020

Paper
Code

Counterfactual Data Augmentation using Locally Factored Dynamics

spitis/mrl • • NeurIPS 2020

Many dynamic processes, including common scenarios in robotic control and reinforcement learning (RL), involve a set of interacting subprocesses.

103

06 Jul 2020

Paper
Code

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

opendilab/DI-engine • • ICML 2020

In this work we aim to solve this problem by optimizing the efficiency and resource utilization of reinforcement learning algorithms instead of relying on distributed computation.

2,539

21 Jun 2020

Paper
Code

Learning to Incentivize Other Learning Agents

011235813/lio • • NeurIPS 2020

The challenge of developing powerful and general Reinforcement Learning (RL) agents has received increasing attention in recent years.

10 Jun 2020

Paper
Code

General Reinforcement Learning

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers

Content

Benchmarks

Add a Result