Search Results for author: Roshan Shariff

Found 6 papers, 0 papers with code

Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have

no code implementations1 Dec 2022 Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski

As a second main contribution of this work, we show how these properties may be implemented together in a proof-of-concept reinforcement learning agent: we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity.

Decision Making reinforcement-learning +1

Efficient Planning in Large MDPs with Weak Linear Function Approximation

no code implementations NeurIPS 2020 Roshan Shariff, Csaba Szepesvári

Large-scale Markov decision processes (MDPs) require planning algorithms with runtime independent of the number of states of the MDP.

Discounted Reinforcement Learning Is Not an Optimization Problem

no code implementations4 Oct 2019 Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton

Discounted reinforcement learning is fundamentally incompatible with function approximation for control in continuing tasks.

Misconceptions reinforcement-learning +1

Differentially Private Contextual Linear Bandits

no code implementations NeurIPS 2018 Roshan Shariff, Or Sheffet

We study the contextual linear bandit problem, a version of the standard stochastic multi-armed bandit (MAB) problem where a learner sequentially selects actions to maximize a reward which depends also on a user provided per-round context.

Conservative Bandits

no code implementations13 Feb 2016 Yifan Wu, Roshan Shariff, Tor Lattimore, Csaba Szepesvári

We consider both the stochastic and the adversarial settings, where we propose, natural, yet novel strategies and analyze the price for maintaining the constraints.

Cannot find the paper you are looking for? You can Submit a new open access paper.