Search Results for author: Nischal Agrawal

Found 1 papers, 0 papers with code

KLUCB Approach to Copeland Bandits

no code implementations • 7 Feb 2019 • Nischal Agrawal, Prasanna Chaporkar

Multi-armed bandit(MAB) problem is a reinforcement learning framework where an agent tries to maximise her profit by proper selection of actions through absolute feedback for each action.

Information Retrieval Retrieval +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.