no code implementations • 22 Nov 2010 • Yi Gai, Bhaskar Krishnamachari, Rahul Jain
Furthermore, these policies only require storage that grows linearly in the number of unknown parameters.
Combinatorial Optimization Multi-Armed Bandits