Acrobot

9 papers with code • 0 benchmarks • 0 datasets

The acrobot system includes two joints and two links, where the joint between the two links is actuated. Initially, the links are hanging downwards, and the goal is to swing the end of the lower link up to a given height.

Benchmarks

Add a Result

These leaderboards are used to track progress in Acrobot

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Latest papers with no code

Most implemented Social Latest No code

Learning sparse representations in reinforcement learning

no code yet • 4 Sep 2019

This has motivated methods that learn internal representations of the agent's state, effectively reducing the size of the state space and restructuring state representations in order to support generalization.

Paper
Add Code

Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces

no code yet • 21 Nov 2017

Our method is applicable to both discrete and continuous action spaces, when competing pathwise methods are limited to the latter.