Search Results for author: Xuheng Li

Found 2 papers, 0 papers with code

Feel-Good Thompson Sampling for Contextual Dueling Bandits

no code implementations9 Apr 2024 Xuheng Li, Heyang Zhao, Quanquan Gu

In this paper, we propose a Thompson sampling algorithm, named FGTS. CDB, for linear contextual dueling bandits.

Decision Making Multi-Armed Bandits +1

Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

no code implementations23 Nov 2023 Xuheng Li, Yihe Deng, Jingfeng Wu, Dongruo Zhou, Quanquan Gu

Additionally, when our analysis is specialized to linear regression in the strongly convex setting, it yields a tighter bound for bias error than the best-known result.

regression

Cannot find the paper you are looking for? You can Submit a new open access paper.