Methodology

Q-Learning

388 papers with code • 0 benchmarks • 2 datasets

The goal of Q-learning is to learn a policy, which tells an agent what action to take under what circumstances.

( Image credit: Playing Atari with Deep Reinforcement Learning )

Benchmarks

Add a Result

These leaderboards are used to track progress in Q-Learning

No evaluation results yet. Help compare methods by submitting evaluation metrics.

Libraries

Use these libraries to find Q-Learning models and implementations

opendilab/DI-engine

6 papers

2,555

zzmtsvv/rl_task

6 papers

hill-a/stable-baselines

5 papers

4,043

toni-sm/skrl

5 papers

404

See all 29 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

Designing Neural Network Architectures using Reinforcement Learning

bowenbaker/metaqnn • • 7 Nov 2016

We introduce MetaQNN, a meta-modeling algorithm based on reinforcement learning to automatically generate high-performing CNN architectures for a given learning task.

Paper
Code

Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning

cts198859/deeprl_dist • • ICML 2017

Many real-world problems, such as network packet routing and urban traffic control, are naturally modeled as multi-agent reinforcement learning (RL) problems.

Paper
Code

Deep Q-learning from Demonstrations

opendilab/DI-engine • • 12 Apr 2017

We present an algorithm, Deep Q-learning from Demonstrations (DQfD), that leverages small sets of demonstration data to massively accelerate the learning process even from relatively small amounts of demonstration data and is able to automatically assess the necessary ratio of demonstration data while learning thanks to a prioritized replay mechanism.

Paper
Code

SQIL: Imitation Learning via Reinforcement Learning with Sparse Rewards

opendilab/DI-engine • • ICLR 2020

Theoretically, we show that SQIL can be interpreted as a regularized variant of BC that uses a sparsity prior to encourage long-horizon imitation.

Paper
Code

QPLEX: Duplex Dueling Multi-Agent Q-Learning

wjh720/QPLEX • • ICLR 2021

This paper presents a novel MARL approach, called duPLEX dueling multi-agent Q-learning (QPLEX), which takes a duplex dueling network architecture to factorize the joint value function.

Paper
Code

IQ-Learn: Inverse soft-Q Learning for Imitation

Div99/IQ-Learn • • NeurIPS 2021

In many sequential decision-making problems (e. g., robotics control, game playing, sequential prediction), human or expert data is available containing useful information about the task.

Paper
Code

Multiagent Cooperation and Competition with Deep Reinforcement Learning

NeuroCSUT/DeepMind-Atari-Deep-Q-Learner-2Player • • 27 Nov 2015

In the present work we extend the Deep Q-Learning Network architecture proposed by Google DeepMind to multiagent environments and investigate how two agents controlled by independent Deep Q-Networks interact in the classic videogame Pong.

Paper
Code

Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning

andyzeng/visual-pushing-grasping • • 27 Mar 2018

Skilled robotic manipulation benefits from complex synergies between non-prehensile (e. g. pushing) and prehensile (e. g. grasping) actions: pushing can help rearrange cluttered objects to make space for arms and fingers; likewise, grasping can help displace objects to make pushing movements more precise and collision-free.

Paper
Code

Benchmarking Batch Deep Reinforcement Learning Algorithms

sfujim/BCQ • • 3 Oct 2019

Widely-used deep reinforcement learning algorithms have been shown to fail in the batch setting--learning from a fixed data set without interaction with the environment.

Paper
Code

Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

oxwhirl/wqmix • • NeurIPS 2020

We show in particular that this projection can fail to recover the optimal policy even with access to $Q^*$, which primarily stems from the equal weighting placed on each joint action.

Paper
Code

Q-Learning

Benchmarks Add a Result

Libraries

Datasets

Most implemented papers

Content

Benchmarks

Add a Result