no code implementations • 16 Sep 2020 • Alexandre Letard, Tassadit Amghar, Olivier Camp, Nicolas Gutowski
Nevertheless, this implicit feedback can be misleading or inefficient for the agent's learning.
Multi-Armed Bandits Recommendation Systems +1