1 code implementation • 20 Jun 2023 • Connor James Stephens, Emmanuel Blazquez
We present a novel approach to system identification (SI) using deep learning techniques.
no code implementations • 30 Oct 2022 • Yao Zhao, Connor James Stephens, Csaba Szepesvári, Kwang-Sung Jun
Simple regret is a natural and parameter-free performance criterion for pure exploration in multi-armed bandits yet is less popular than the probability of missing the best arm or an $\epsilon$-good arm, perhaps due to lack of easy ways to characterize it.