Acrobot

9 papers with code • 0 benchmarks • 0 datasets

The acrobot system includes two joints and two links, where the joint between the two links is actuated. Initially, the links are hanging downwards, and the goal is to swing the end of the lower link up to a given height.

Latest papers with no code

Learning sparse representations in reinforcement learning

no code yet • 4 Sep 2019

This has motivated methods that learn internal representations of the agent's state, effectively reducing the size of the state space and restructuring state representations in order to support generalization.

Deterministic Policy Optimization by Combining Pathwise and Score Function Estimators for Discrete Action Spaces

no code yet • 21 Nov 2017

Our method is applicable to both discrete and continuous action spaces, when competing pathwise methods are limited to the latter.