Search Results for author: Nicklas Werge

Found 8 papers, 1 papers with code

Probabilistic Actor-Critic: Learning to Explore with PAC-Bayes Uncertainty

no code implementations • 5 Feb 2024 • Bahareh Tasdighi, Nicklas Werge, Yi-Shan Wu, Melih Kandemir

We introduce Probabilistic Actor-Critic (PAC), a novel reinforcement learning algorithm with improved continuous control performance thanks to its ability to mitigate the exploration-exploitation trade-off.

Continuous Control Decision Making +1

Paper
Add Code

Demystifying the Myths and Legends of Nonconvex Convergence of SGD

no code implementations • 19 Oct 2023 • Aritra Dutta, El Houcine Bergou, Soumia Boucherouite, Nicklas Werge, Melih Kandemir, Xin Li

Additionally, our analyses allow us to measure the density of the $\epsilon$-stationary points in the final iterates of SGD, and we recover the classical $O(\frac{1}{\sqrt{T}})$ asymptotic rate under various existing assumptions on the objective function and the bounds on the stochastic gradient.

Paper
Add Code

BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits

no code implementations • 7 Jul 2023 • Nicklas Werge, Abdullah Akgül, Melih Kandemir

We propose a novel Bayesian-Optimistic Frequentist Upper Confidence Bound (BOF-UCB) algorithm for stochastic contextual linear bandits in non-stationary environments.

Decision Making Multi-Armed Bandits

Paper
Add Code

An adaptive volatility method for probabilistic forecasting and its application to the M6 financial forecasting competition

no code implementations • 3 Mar 2023 • Joseph de Vilmarest, Nicklas Werge

In this note, we address the problem of probabilistic forecasting using an adaptive volatility method based on classical time-varying volatility models and stochastic optimization algorithms.

Decision Making Stochastic Optimization

Paper
Add Code

Learning from time-dependent streaming data with online stochastic algorithms

no code implementations • 25 May 2022 • Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger

This paper addresses stochastic optimization in a streaming setting with time-dependent and biased gradient estimates.

Stochastic Optimization

Paper
Add Code

Non-Asymptotic Analysis of Stochastic Approximation Algorithms for Streaming Data

no code implementations • 15 Sep 2021 • Antoine Godichon-Baggioni, Nicklas Werge, Olivier Wintenberger

We provide non-asymptotic convergence rates of various gradient-based algorithms; this includes the famous Stochastic Gradient (SG) descent (a. k. a.

Paper
Add Code

Predicting Risk-adjusted Returns using an Asset Independent Regime-switching Model

no code implementations • 7 Jul 2021 • Nicklas Werge

Financial markets tend to switch between various market regimes over time, making stationarity-based models unsustainable.

Paper
Add Code

AdaVol: An Adaptive Recursive Volatility Prediction Method

1 code implementation • 3 Jun 2020 • Nicklas Werge, Olivier Wintenberger

An investigation of the convergence properties of the QML procedure in a general conditionally heteroscedastic time series model is conducted, and the classical batch optimization routines extended to the framework of streaming and large-scale problems.

Time Series Time Series Analysis

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.