no code implementations • 9 Apr 2024 • Xuheng Li, Heyang Zhao, Quanquan Gu
In this paper, we propose a Thompson sampling algorithm, named FGTS. CDB, for linear contextual dueling bandits.
no code implementations • 23 Nov 2023 • Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu
Additionally, when our analysis is specialized to linear regression in the strongly convex setting, it yields a tighter bound for bias error than the best-known result.