Search Results for author: Shuze Liu

Found 2 papers, 0 papers with code

The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise

no code implementations15 Jan 2024 Shuze Liu, Shuhang Chen, Shangtong Zhang

Stochastic approximation is a class of algorithms that update a vector iteratively, incrementally, and stochastically, including, e. g., stochastic gradient descent and temporal difference learning.

reinforcement-learning

Improving Monte Carlo Evaluation with Offline Data

no code implementations31 Jan 2023 Shuze Liu, Shangtong Zhang

Most reinforcement learning practitioners evaluate their policies with online Monte Carlo estimators for either hyperparameter tuning or testing different algorithmic design choices, where the policy is repeatedly executed in the environment to get the average outcome.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.