no code implementations • 5 Feb 2024 • Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir
We introduce Probabilistic Actor-Critic (PAC), a novel reinforcement learning algorithm with improved continuous control performance thanks to its ability to mitigate the exploration-exploitation trade-off.
no code implementations • 19 Oct 2023 • Aritra Dutta, El Houcine Bergou, Soumia Boucherouite, Nicklas Werge, Melih Kandemir, Xin Li
Additionally, our analyses allow us to measure the density of the $\epsilon$-stationary points in the final iterates of SGD, and we recover the classical $O(\frac{1}{\sqrt{T}})$ asymptotic rate under various existing assumptions on the objective function and the bounds on the stochastic gradient.
no code implementations • 7 Jul 2023 • Nicklas Werge, Abdullah Akgül, Melih Kandemir
We propose a novel Bayesian-Optimistic Frequentist Upper Confidence Bound (BOF-UCB) algorithm for stochastic contextual linear bandits in non-stationary environments.
no code implementations • 3 Mar 2023 • Joseph de Vilmarest, Nicklas Werge
In this note, we address the problem of probabilistic forecasting using an adaptive volatility method based on classical time-varying volatility models and stochastic optimization algorithms.
no code implementations • 25 May 2022 • Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger
This paper addresses stochastic optimization in a streaming setting with time-dependent and biased gradient estimates.
no code implementations • 15 Sep 2021 • Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger
We provide non-asymptotic convergence rates of various gradient-based algorithms; this includes the famous Stochastic Gradient (SG) descent (a. k. a.
no code implementations • 7 Jul 2021 • Nicklas Werge
Financial markets tend to switch between various market regimes over time, making stationarity-based models unsustainable.
1 code implementation • 3 Jun 2020 • Nicklas Werge, Olivier Wintenberger
An investigation of the convergence properties of the QML procedure in a general conditionally heteroscedastic time series model is conducted, and the classical batch optimization routines extended to the framework of streaming and large-scale problems.