Efficient Exploration

144 papers with code • 0 benchmarks • 2 datasets

Efficient Exploration is one of the main obstacles in scaling up modern deep reinforcement learning algorithms. The main challenge in Efficient Exploration is the balance between exploiting current estimates, and gaining information about poorly understood states and actions.

Source: Randomized Value Functions via Multiplicative Normalizing Flows

Libraries

Use these libraries to find Efficient Exploration models and implementations
2 papers
25

Most implemented papers

Self-Supervised Exploration via Disagreement

pathak22/exploration-by-disagreement 10 Jun 2019

In this paper, we propose a formulation for exploration inspired by the work in active learning literature.

Hybrid Genetic Search for the CVRP: Open-Source Implementation and SWAP* Neighborhood

vidalt/HGS-CVRP 23 Nov 2020

The vehicle routing problem is one of the most studied combinatorial optimization topics, due to its practical importance and methodological interest.

BeBold: Exploration Beyond the Boundary of Explored Regions

facebookresearch/nle 15 Dec 2020

In this paper, we analyze the pros and cons of each method and propose the regulated difference of inverse visitation counts as a simple but effective criterion for IR.

State Entropy Maximization with Random Encoders for Efficient Exploration

ray-project/ray ICLR Workshop SSL-RL 2021

Recent exploration methods have proven to be a recipe for improving sample-efficiency in deep reinforcement learning (RL).

Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration

hyunghona/emu NeurIPS 2021

Efficient exploration in deep cooperative multi-agent reinforcement learning (MARL) still remains challenging in complex coordination problems.

Online Decision Transformer

facebookresearch/online-dt 11 Feb 2022

Recent work has shown that offline reinforcement learning (RL) can be formulated as a sequence modeling problem (Chen et al., 2021; Janner et al., 2021) and solved via approaches similar to large-scale language modeling.

Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

med-air/dex 20 Feb 2023

Task automation of surgical robot has the potentials to improve surgical efficiency.

Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study

baai-agents/cradle 5 Mar 2024

Despite the success in specific tasks and scenarios, existing foundation agents, empowered by large models (LMs) and advanced tools, still cannot generalize to different scenarios, mainly due to dramatic differences in the observations and actions across scenarios.

Generalization and Exploration via Randomized Value Functions

qdevpsi3/randomized-value-iteration 4 Feb 2014

We propose randomized least-squares value iteration (RLSVI) -- a new reinforcement learning algorithm designed to explore and generalize efficiently via linearly parameterized value functions.

Batch Bayesian Optimization via Local Penalization

SheffieldML/GPyOpt 29 May 2015

The approach assumes that the function of interest, $f$, is a Lipschitz continuous function.