Search Results for author: Roshan Shariff

Found 6 papers, 0 papers with code

Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have

no code implementations • 1 Dec 2022 • Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski

As a second main contribution of this work, we show how these properties may be implemented together in a proof-of-concept reinforcement learning agent: we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity.

Decision Making reinforcement-learning +1

Paper
Add Code

Prototyping three key properties of specific curiosity in computational reinforcement learning

no code implementations • 20 May 2022 • Nadia M. Ady, Roshan Shariff, Johannes Günther, Patrick M. Pilarski

Curiosity for machine agents has been a focus of intense research.

Decision Making reinforcement-learning +1

Paper
Add Code

Efficient Planning in Large MDPs with Weak Linear Function Approximation

no code implementations • NeurIPS 2020 • Roshan Shariff, Csaba Szepesvári

Large-scale Markov decision processes (MDPs) require planning algorithms with runtime independent of the number of states of the MDP.

Paper
Add Code

Discounted Reinforcement Learning Is Not an Optimization Problem

no code implementations • 4 Oct 2019 • Abhishek Naik, Roshan Shariff, Niko Yasui, Hengshuai Yao, Richard S. Sutton

Discounted reinforcement learning is fundamentally incompatible with function approximation for control in continuing tasks.

Misconceptions reinforcement-learning +1

Paper
Add Code

Differentially Private Contextual Linear Bandits

no code implementations • NeurIPS 2018 • Roshan Shariff, Or Sheffet

We study the contextual linear bandit problem, a version of the standard stochastic multi-armed bandit (MAB) problem where a learner sequentially selects actions to maximize a reward which depends also on a user provided per-round context.

Paper
Add Code

Conservative Bandits

no code implementations • 13 Feb 2016 • Yifan Wu, Roshan Shariff, Tor Lattimore, Csaba Szepesvári

We consider both the stochastic and the adversarial settings, where we propose, natural, yet novel strategies and analyze the price for maintaining the constraints.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.