Search Results for author: Avinash Mohan

Found 4 papers, 0 papers with code

Actor-Critic based Improper Reinforcement Learning

no code implementations19 Jul 2022 Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

For the AC-based approach we provide convergence rate guarantees to a stationary point in the basic AC case and to a global optimum in the NAC case.

reinforcement-learning Reinforcement Learning (RL)

Improper Reinforcement Learning with Gradient-based Policy Optimization

no code implementations16 Feb 2021 Mohammadi Zaki, Avinash Mohan, Aditya Gopalan, Shie Mannor

We consider an improper reinforcement learning setting where a learner is given $M$ base controllers for an unknown Markov decision process, and wishes to combine them optimally to produce a potentially new controller that can outperform each of the base ones.

reinforcement-learning Reinforcement Learning (RL)

Towards Optimal and Efficient Best Arm Identification in Linear Bandits

no code implementations5 Nov 2019 Mohammadi Zaki, Avinash Mohan, Aditya Gopalan

We give a new algorithm for best arm identification in linearly parameterised bandits in the fixed confidence setting.

Cannot find the paper you are looking for? You can Submit a new open access paper.