Badge
Markdown
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-2d-walker)](https://paperswithcode.com/sota/continuous-control-on-2d-walker?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-acrobot)](https://paperswithcode.com/sota/continuous-control-on-acrobot?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-acrobot-limited-sensors)](https://paperswithcode.com/sota/continuous-control-on-acrobot-limited-sensors?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-acrobot-noisy)](https://paperswithcode.com/sota/continuous-control-on-acrobot-noisy?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-acrobot-system)](https://paperswithcode.com/sota/continuous-control-on-acrobot-system?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-ant)](https://paperswithcode.com/sota/continuous-control-on-ant?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-ant-gathering)](https://paperswithcode.com/sota/continuous-control-on-ant-gathering?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-ant-maze)](https://paperswithcode.com/sota/continuous-control-on-ant-maze?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-cart-pole-balancing)](https://paperswithcode.com/sota/continuous-control-on-cart-pole-balancing?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-cart-pole-balancing-1)](https://paperswithcode.com/sota/continuous-control-on-cart-pole-balancing-1?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-cart-pole-balancing-2)](https://paperswithcode.com/sota/continuous-control-on-cart-pole-balancing-2?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-cart-pole-balancing-3)](https://paperswithcode.com/sota/continuous-control-on-cart-pole-balancing-3?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-double-inverted)](https://paperswithcode.com/sota/continuous-control-on-double-inverted?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-full-humanoid)](https://paperswithcode.com/sota/continuous-control-on-full-humanoid?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-half-cheetah)](https://paperswithcode.com/sota/continuous-control-on-half-cheetah?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-hopper)](https://paperswithcode.com/sota/continuous-control-on-hopper?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-inverted-pendulum)](https://paperswithcode.com/sota/continuous-control-on-inverted-pendulum?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-inverted-pendulum-1)](https://paperswithcode.com/sota/continuous-control-on-inverted-pendulum-1?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-inverted-pendulum-noisy)](https://paperswithcode.com/sota/continuous-control-on-inverted-pendulum-noisy?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-inverted-pendulum-2)](https://paperswithcode.com/sota/continuous-control-on-inverted-pendulum-2?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-mountain-car)](https://paperswithcode.com/sota/continuous-control-on-mountain-car?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-mountain-car-limited)](https://paperswithcode.com/sota/continuous-control-on-mountain-car-limited?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-mountain-car-noisy)](https://paperswithcode.com/sota/continuous-control-on-mountain-car-noisy?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-mountain-car-system)](https://paperswithcode.com/sota/continuous-control-on-mountain-car-system?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-simple-humanoid)](https://paperswithcode.com/sota/continuous-control-on-simple-humanoid?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-swimmer)](https://paperswithcode.com/sota/continuous-control-on-swimmer?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-swimmer-gathering)](https://paperswithcode.com/sota/continuous-control-on-swimmer-gathering?p=benchmarking-deep-reinforcement-learning-for)
[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/benchmarking-deep-reinforcement-learning-for/continuous-control-on-swimmer-maze)](https://paperswithcode.com/sota/continuous-control-on-swimmer-maze?p=benchmarking-deep-reinforcement-learning-for)