1 code implementation • 23 Dec 2019 • Tian Tan, Zhihan Xiong, Vikranth R. Dwaracherla
We use an indexed value function to represent uncertainty in our action-value estimates.
no code implementations • 27 Nov 2015 • Vivek S. Borkar, Vikranth R. Dwaracherla, Neeraja Sahasrabudhe
This paper aims at achieving a "good" estimator for the gradient of a function on a high-dimensional space.