Search Results for author: Daniel Visentin

Found 1 papers, 0 papers with code

Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions

no code implementations • 3 Dec 2015 • Peter Sunehag, Richard Evans, Gabriel Dulac-Arnold, Yori Zwols, Daniel Visentin, Ben Coppin

Further, we use deep deterministic policy gradients to learn a policy that for each position of the slate, guides attention towards the part of the action space in which the value is the highest and we only evaluate actions in this area.

Q-Learning Recommendation Systems

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.