Randomized Ensembled Double QLearning: Learning Fast Without a Model
Xinyue Chen
•
Che Wang
•
Zijian Zhou
•
Keith Ross

20210115

On the Estimation Bias in Double QLearning
Anonymous

20210101

Randomized Ensembled Double QLearning: Learning Fast Without a Model
Anonymous

20210101

Weighted Bellman Backups for Improved SignaltoNoise in QUpdates
Anonymous

20210101

Double Qlearning: New Analysis and Sharper Finitetime Bound
Anonymous

20210101

Resolving Implicit Coordination in MultiAgent Deep Reinforcement Learning with Deep QNetworks & Game Theory
Griffin Adams
•
Sarguna Janani Padmanabhan
•
Shivang Shekhar

20201208

Selfcorrecting QLearning
Rong Zhu
•
Mattia Rigotti

20201202

The MeanSquared Error of Double QLearning

Wentao Weng
•
Harsh Gupta
•
Niao He
•
Lei Ying
•
R. Srikant

20201201

Energy and Servicepriority aware Trajectory Design for UAVBSs using Double QLearning
Sayed Amir Hoseini
•
Ayub Bokani
•
Jahan Hassan
•
Shavbo Salehi
•
Salil S. Kanhere

20201026

FiniteTime Analysis for Double Qlearning
Huaqing Xiong
•
Lin Zhao
•
Yingbin Liang
•
Wei zhang

20200929

A New Approach for Tactical Decision Making in Lane Changing: Sample Efficient Deep Q Learning with a Safety Feedback Reward
M. Ugur Yavas
•
N. Kemal Ure
•
Tufan Kumbasar

20200924

Reinforcement Learning with Quantum Variational Circuits
Owen Lockwood
•
Mei Si

20200815

Chrome Dino Run using Reinforcement Learning
Divyanshu Marwah
•
Sneha Srivastava
•
Anusha Gupta
•
Shruti Verma

20200815

QPLEX: Duplex Dueling MultiAgent QLearning
Jianhao Wang
•
Zhizhou Ren
•
Terry Liu
•
Yang Yu
•
Chongjie Zhang

20200803

SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning

Kimin Lee
•
Michael Laskin
•
Aravind Srinivas
•
Pieter Abbeel

20200709

ProvablyEfficient Double QLearning
Wentao Weng
•
Harsh Gupta
•
Niao He
•
Lei Ying
•
R. Srikant

20200709

Regularly Updated Deterministic Policy Gradient Algorithm
Shuai Han
•
Wenbo Zhou
•
Shuai Lü
•
Jiayu Yu

20200701

Noise, overestimation and exploration in Deep Reinforcement Learning
Rafael Stekolshchik

20200625

Deep Reinforcement Learning Control for Radar Detection and Tracking in Congested Spectral Environments
Charles E. Thornton
•
Mark A. Kozy
•
R. Michael Buehrer
•
Anthony F. Martone
•
Kelly D. Sherbondy

20200623

Parameterized MDPs and Reinforcement Learning Problems  A Maximum Entropy Principle Based Framework
Amber Srivastava
•
Srinivasa M Salapaka

20200617

Decorrelated Double Qlearning
Gang Chen

20200612

Balancing a CartPole System with Reinforcement Learning  A Tutorial
Swagat Kumar

20200608

Acme: A Research Framework for Distributed Reinforcement Learning

Matt Hoffman
•
Bobak Shahriari
•
John Aslanides
•
Gabriel BarthMaron
•
Feryal Behbahani
•
Tamara Norman
•
Abbas Abdolmaleki
•
Albin Cassirer
•
Fan Yang
•
Kate Baumli
•
Sarah Henderson
•
Alex Novikov
•
Sergio Gómez Colmenarejo
•
Serkan Cabi
•
Caglar Gulcehre
•
Tom Le Paine
•
Andrew Cowie
•
Ziyu Wang
•
Bilal Piot
•
Nando de Freitas

20200601

Basal Glucose Control in Type 1 Diabetes using Deep Reinforcement Learning: An In Silico Validation
Taiyu Zhu
•
Kezhi Li
•
Pau Herrero
•
Pantelis Georgiou

20200518

A Double QLearning Approach for Navigation of Aerial Vehicles with Connectivity Constraint
Behzad Khamidehi
•
Elvino S. Sousa

20200224

Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
Yuanyi Zhong
•
Alexander Schwing
•
Jian Peng

20200221

Fast Reinforcement Learning for Antijamming Communications
PeiGen Ye
•
YuanGen Wang
•
Jin Li
•
Liang Xiao

20200213

$γ$Regret for NonEpisodic Reinforcement Learning
Shuang Liu
•
Hao Su

20200212

Dynamically Balanced Value Estimates for ActorCritic Methods
Anonymous

20200101

Do recent advancements in modelbased deep reinforcement learning really improve data efficiency?
Anonymous

20200101

SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement Learning

Keng Wah Loon
•
Laura Graesser
•
Milan Cvitkovic

20191228

Exploiting the potential of deep reinforcement learning for classification tasks in highdimensional and unstructured data
Johan S. ObandoCeron
•
Victor Romero Cano
•
Walter Mayor Toro

20191220

TaskOriented Language Grounding for Language Input with Multiple SubGoals of NonLinear Order

Vladislav Kurenkov
•
Bulat Maksudov
•
Adil Khan

20191027

Reverse Experience Replay
Egor Rotinov

20191019

To Combine or Not To Combine? A Rainbow Deep Reinforcement Learning Agent for Dialog Policies
Dirk V{\"a}th
•
Ngoc Thang Vu

20190901

Performing Deep Recurrent Double QLearning for Atari Games
Felipe MorenoVera

20190816

Largescale Traffic Signal Control Using a Novel MultiAgent Reinforcement Learning
Xiaoqiang Wang
•
Liangjun Ke
•
Zhimin Qiao
•
Xinghua Chai

20190810

A Story of Two Streams: Reinforcement Learning Models from Human Behavior and Neuropsychiatry

Baihan Lin
•
Guillermo Cecchi
•
Djallel Bouneffouf
•
Jenna Reinen
•
Irina Rish

20190621

Generative Adversarial Imagination for Sample Efficient Deep Reinforcement Learning
Kacper Kielak

20190430

Double Deep QLearning for Optimal Execution
Brian Ning
•
Franco Ho Ting Lin
•
Sebastian Jaimungal

20181217

Revisiting the Softmax Bellman Operator: New Benefits and New Perspective

Zhao Song
•
Ronald E. Parr
•
Lawrence Carin

20181202

Macro action selection with deep reinforcement learning in StarCraft

Sijia Xu
•
Hongyu Kuang
•
Zhi Zhuang
•
Renjie Hu
•
Yang Liu
•
Huyang Sun

20181202

Distributed Prioritized Experience Replay

Dan Horgan
•
John Quan
•
David Budden
•
Gabriel BarthMaron
•
Matteo Hessel
•
Hado van Hasselt
•
David Silver

20180302

Addressing Function Approximation Error in ActorCritic Methods

Scott Fujimoto
•
Herke van Hoof
•
David Meger

20180226

Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments
Yan Zheng
•
Jianye Hao
•
Zongzhang Zhang

20180223

Efficient Exploration through Bayesian Deep QNetworks

Kamyar Azizzadenesheli
•
Animashree Anandkumar

20180213

Faster Deep Qlearning using Neural Episodic Control
Daichi Nishio
•
Satoshi Yamane

20180106

Rainbow: Combining Improvements in Deep Reinforcement Learning

Matteo Hessel
•
Joseph Modayil
•
Hado van Hasselt
•
Tom Schaul
•
Georg Ostrovski
•
Will Dabney
•
Dan Horgan
•
Bilal Piot
•
Mohammad Azar
•
David Silver

20171006

Noisy Networks for Exploration

Meire Fortunato
•
Mohammad Gheshlaghi Azar
•
Bilal Piot
•
Jacob Menick
•
Ian Osband
•
Alex Graves
•
Vlad Mnih
•
Remi Munos
•
Demis Hassabis
•
Olivier Pietquin
•
Charles Blundell
•
Shane Legg

20170630

Sample Efficient ActorCritic with Experience Replay

Ziyu Wang
•
Victor Bapst
•
Nicolas Heess
•
Volodymyr Mnih
•
Remi Munos
•
Koray Kavukcuoglu
•
Nando de Freitas

20161103

Dynamic Frame skip Deep Q Network
Aravind S. Lakshminarayanan
•
Sahil Sharma
•
Balaraman Ravindran

20160517

Dueling Network Architectures for Deep Reinforcement Learning

Ziyu Wang
•
Tom Schaul
•
Matteo Hessel
•
Hado van Hasselt
•
Marc Lanctot
•
Nando de Freitas

20151120

Deep Reinforcement Learning with Double Qlearning

Hado van Hasselt
•
Arthur Guez
•
David Silver

20150922

Double Qlearning
Hado V. Hasselt

20101201
