Search Results for author: Bowen Weng

Found 6 papers, 0 papers with code

Momentum Q-learning with Finite-Sample Convergence Guarantee

no code implementations • 30 Jul 2020 • Bowen Weng, Huaqing Xiong, Lin Zhao, Yingbin Liang, Wei zhang

For the infinite state-action space case, we establish the convergence guarantee for MomentumQ with linear function approximations and Markovian sampling.

Q-Learning

Paper
Add Code

Analysis of Q-learning with Adaptation and Momentum Restart for Gradient Descent

no code implementations • 15 Jul 2020 • Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

In this paper, we first characterize the convergence rate for Q-AMSGrad, which is the Q-learning algorithm with AMSGrad update (a commonly adopted alternative of Adam for theoretical analysis).

Atari Games Q-Learning

Paper
Add Code

History-Gradient Aided Batch Size Adaptation for Variance Reduced Algorithms

no code implementations • ICML 2020 • Kaiyi Ji, Zhe Wang, Bowen Weng, Yi Zhou, Wei zhang, Yingbin Liang

In this paper, we propose a novel scheme, which eliminates backtracking line search but still exploits the information along optimization path by adapting the batch size via history stochastic gradients.

Paper
Add Code

Hybrid Zero Dynamics Inspired Feedback Control Policy Design for 3D Bipedal Locomotion using Reinforcement Learning

no code implementations • 3 Oct 2019 • Guillermo A. Castillo, Bowen Weng, Wei zhang, Ayonga Hereid

This paper presents a novel model-free reinforcement learning (RL) framework to design feedback control policies for 3D bipedal walking.

Reinforcement Learning (RL)

Paper
Add Code

CAN ALTQ LEARN FASTER: EXPERIMENTS AND THEORY

no code implementations • 25 Sep 2019 • Bowen Weng, Huaqing Xiong, Yingbin Liang, Wei zhang

Differently from the popular Deep Q-Network (DQN) learning, Alternating Q-learning (AltQ) does not fully fit a target Q-function at each iteration, and is generally known to be unstable and inefficient.

Atari Games Q-Learning

Paper
Add Code

Accelerated Target Updates for Q-learning

no code implementations • 7 May 2019 • Bowen Weng, Huaqing Xiong, Wei zhang

This paper studies accelerations in Q-learning algorithms.

Atari Games Q-Learning +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.