no code implementations • 30 Mar 2024 • Haijie Xu, Xiaochen Xian, Chen Zhang, Kaibo Liu
Meanwhile, by treating the detection power as a reward, its connection with the online combinatorial multi-armed bandit (CMAB) problem is formulated and an adaptive upper confidence region algorithm is proposed for adaptive sampling policy design.
no code implementations • 30 Mar 2024 • Haijie Xu, Chen Zhang
Contrasts with existing works which all consider nodes as functions and use edges to represent the relationships between different functions.