OpenAI Gym

163 papers with code • 9 benchmarks • 3 datasets

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Benchmarks

Add a Result

These leaderboards are used to track progress in OpenAI Gym

Dataset	Best Model	Compare
LunarLander-v2	Oblique decision tree	See all
CartPole-v1	Orthogonal decision tree	See all
Mountain Car	Orthogonal decision tree	See all
Cart Pole (OpenAI Gym)	Oblique decision tree	See all
Ant-v2	AWR	See all
HalfCheetah-v2	AWR	See all
Hopper-v2	AWR	See all
Humanoid-v2	AWR	See all
Walker2d-v2	AWR	See all

Libraries

Use these libraries to find OpenAI Gym models and implementations

toni-sm/skrl

2 papers

413

Datasets

Subtasks

Acrobot

Latest papers with no code

Most implemented Social Latest No code

On Combining Expert Demonstrations in Imitation Learning via Optimal Transport

no code yet • 20 Jul 2023

One of the key approaches to IL is to define a distance between agent and expert and to find an agent policy that minimizes that distance.

Paper
Add Code

Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing

no code yet • 11 Jul 2023

Our research demonstrates that to achieve $\epsilon$-optimal policies for all $M$ tasks, a single agent using DistMT-LSVI needs to run a total number of episodes that is at most $\tilde{\mathcal{O}}({d^3H^6(\epsilon^{-2}+c_{\rm sep}^{-2})}\cdot M/N)$, where $c_{\rm sep}>0$ is a constant representing task separability, $H$ is the horizon of each episode, and $d$ is the feature dimension of the dynamics and rewards.

Paper
Add Code

Learning Environment Models with Continuous Stochastic Dynamics

no code yet • 29 Jun 2023

We aim to provide insights into the decisions faced by the agent by learning an automaton model of environmental behavior under the control of an agent.

Paper
Add Code

Correcting discount-factor mismatch in on-policy policy gradient methods

no code yet • 23 Jun 2023

The policy gradient theorem gives a convenient form of the policy gradient in terms of three factors: an action value, a gradient of the action likelihood, and a state distribution involving discounting called the \emph{discounted stationary distribution}.

Paper
Add Code

Deep Reinforcement Learning for ESG financial portfolio management

no code yet • 19 Jun 2023

This paper investigates the application of Deep Reinforcement Learning (DRL) for Environment, Social, and Governance (ESG) financial portfolio management, with a specific focus on the potential benefits of ESG score-based market regulation.

Paper
Add Code

Mimicking Better by Matching the Approximate Action Distribution

no code yet • 16 Jun 2023

In this paper, we introduce MAAD, a novel, sample-efficient on-policy algorithm for Imitation Learning from Observations.

Paper
Add Code

Active Inference in Hebbian Learning Networks

no code yet • 8 Jun 2023

This work studies how brain-inspired neural ensembles equipped with local Hebbian plasticity can perform active inference (AIF) in order to control dynamical agents.

Paper
Add Code

Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning

no code yet • 17 May 2023

In this study, we tackle this challenge by introducing an off-policy inverse multi-agent reinforcement learning algorithm (IMARL).

Paper
Add Code

Rethinking Population-assisted Off-policy Reinforcement Learning

no code yet • 4 May 2023

In this paper, we first analyze the use of off-policy RL algorithms in combination with population-based algorithms, showing that the use of population data could introduce an overlooked error and harm performance.

Paper
Add Code

Gym-preCICE: Reinforcement Learning Environments for Active Flow Control

no code yet • 3 May 2023

Active flow control (AFC) involves manipulating fluid flow over time to achieve a desired performance or efficiency.

Paper
Add Code

OpenAI Gym

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result