no code implementations • 5 Feb 2024 • Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir
We introduce Probabilistic Actor-Critic (PAC), a novel reinforcement learning algorithm with improved continuous control performance thanks to its ability to mitigate the exploration-exploitation trade-off.
no code implementations • 2 Oct 2023 • Yijie Zhang, Yi-Shan Wu, Luis A. Ortega, Andrés R. Masegosa
The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$).
1 code implementation • 1 Jun 2022 • Yi-Shan Wu, Yevgeny Seldin
We present a new concentration of measure inequality for sums of independent bounded random variables, which we name a split-kl inequality.
1 code implementation • NeurIPS 2021 • Yi-Shan Wu, Andrés R. Masegosa, Stephan S. Lorenzen, Christian Igel, Yevgeny Seldin
The bound is based on a novel parametric form of the Chebyshev- Cantelli inequality (a. k. a.