no code implementations • 3 Nov 2023 • Qianxin Yi, Yiyang Yang, Shaojie Tang, Jiapeng Liu, Yao Wang
In this paper, we aim to build a novel bandits algorithm that is capable of fully harnessing the power of multi-dimensional data and the inherent non-linearity of reward functions to provide high-usable and accountable decision-making services.