Search Results for author: Bingshan Hu

Found 3 papers, 0 papers with code

Efficient and Adaptive Posterior Sampling Algorithms for Bandits

no code implementations • 2 May 2024 • Bingshan Hu, Zhiming Huang, Tianyue H. Zhang, Mathias Lécuyer, Nidhi Hegde

We study Thompson Sampling-based algorithms for stochastic bandits with bounded rewards.

Paper
Add Code

Near-Optimal Algorithms for Private Online Learning in a Stochastic Environment

no code implementations • 16 Feb 2021 • Bingshan Hu, Zhiming Huang, Nishant A. Mehta

Specifically, for the problem of decision-theoretic online learning with stochastic rewards, we present the first algorithm that achieves an $ O \left( \frac{ \log K}{ \Delta_{\min}} + \frac{\log(K) \min\{\log (\frac{1}{\Delta_{\min}}), \log(T)\}}{\epsilon} \right)$ regret bound, where $\Delta_{\min}$ is the minimum mean reward gap.

Paper
Add Code

Thompson Sampling for Combinatorial Semi-bandits with Sleeping Arms and Long-Term Fairness Constraints

no code implementations • 14 May 2020 • Zhiming Huang, Yifan Xu, Bingshan Hu, QiPeng Wang, Jianping Pan

We study the combinatorial sleeping multi-armed semi-bandit problem with long-term fairness constraints~(CSMAB-F).

Fairness Movie Recommendation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.