MuJoCo Games

6 papers with code • 17 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in MuJoCo Games

Dataset	Best Model	Compare
Ant	IQ-Learn	See all
Walker2d	IQ-Learn	See all
HalfCheetah	POP3D	See all
Hopper	POP3D	See all
InvertedDoublePendulum	POP3D	See all
InvertedPendulum	POP3D	See all
Reacher	POP3D	See all
Swimmer	POP3D	See all
Point Maze	PEMIRL	See all
Sweeper	PEMIRL	See all
Sawyer Pusher	PEMIRL	See all
Humanoid-v2	IQ-Learn	See all
Ant-v3	ParPI	See all
HalfCHeetah-v3	ParPI	See all
Hopper-v3	ParPI	See all
Humanoid-v3	ParPI	See all
Walker2d-v3	ParPI	See all

Show all 17 benchmarks

Collapse benchmarks

Datasets

MO-Gymnasium

Subtasks

D4RL

Latest papers

Most implemented Social Latest No code

LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning

robfiras/ls-iq • • 1 Mar 2023

Recent methods for imitation learning directly learn a $Q$-function using an implicit reward formulation rather than an explicit reward function.

01 Mar 2023

Paper
Code

A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games

deepmind/open_spiel • 12 Jun 2022

This work studies an algorithm, which we call magnetic mirror descent, that is inspired by mirror descent and the non-Euclidean proximal gradient algorithm.

3,989

12 Jun 2022

Paper
Code

EDGE: Explaining Deep Reinforcement Learning Policies

henrygwb/edge • • NeurIPS 2021

With the rapid development of deep reinforcement learning (DRL) techniques, there is an increasing need to understand and interpret DRL policies.

01 Dec 2021

Paper
Code

IQ-Learn: Inverse soft-Q Learning for Imitation

Div99/IQ-Learn • • NeurIPS 2021

In many sequential decision-making problems (e. g., robotics control, game playing, sequential prediction), human or expert data is available containing useful information about the task.

184

23 Jun 2021

Paper
Code

Weak Human Preference Supervision For Deep Reinforcement Learning

kaichiuwong/rlhps • 25 Jul 2020

The current reward learning from human preferences could be used to resolve complex reinforcement learning (RL) tasks without access to a reward function by defining a single fixed preference between pairs of trajectory segments.

25 Jul 2020

Paper
Code

RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

deepmind/deepmind-research • • 24 Jun 2020

We hope that our suite of benchmarks will increase the reproducibility of experiments and make it possible to study challenging tasks with a limited computational budget, thus making RL research both more systematic and more accessible across the community.

12,779

24 Jun 2020

Paper
Code

MuJoCo Games

Benchmarks Add a Result

Datasets

Subtasks

Latest papers

LS-IQ: Implicit Reward Regularization for Inverse Reinforcement Learning

A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games

EDGE: Explaining Deep Reinforcement Learning Policies

IQ-Learn: Inverse soft-Q Learning for Imitation

Weak Human Preference Supervision For Deep Reinforcement Learning

RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning

Content

Benchmarks

Add a Result