Search Results for author: Yassir Jedra

Found 11 papers, 1 papers with code

Low-Rank Bandits via Tight Two-to-Infinity Singular Subspace Recovery

no code implementations24 Feb 2024 Yassir Jedra, William Réveillard, Stefan Stojanovic, Alexandre Proutiere

For policy evaluation and best policy identification, we show that our algorithms are nearly minimax optimal.

Multi-Armed Bandits

Constrained Deep Reinforcement Learning for Fronthaul Compression Optimization

no code implementations26 Sep 2023 Axel Grönland, Alessio Russo, Yassir Jedra, Bleron Klaiqi, Xavier Gelabert

In the Centralized-Radio Access Network (C-RAN) architecture, functions can be placed in the central or distributed locations.

Quantization reinforcement-learning

A Tutorial on the Non-Asymptotic Theory of System Identification

no code implementations7 Sep 2023 Ingvar Ziemann, Anastasios Tsiamis, Bruce Lee, Yassir Jedra, Nikolai Matni, George J. Pappas

This tutorial serves as an introduction to recently developed non-asymptotic methods in the theory of -- mainly linear -- system identification.

Exploiting Observation Bias to Improve Matrix Completion

no code implementations7 Jun 2023 Yassir Jedra, Sean Mann, Charlotte Park, Devavrat Shah

Instead of treating this observation bias as a disadvantage, as is typically the case, the goal is to exploit the shared information between the bias and the outcome of interest to improve predictions.

Matrix Completion

Nearly Optimal Latent State Decoding in Block MDPs

1 code implementation17 Aug 2022 Yassir Jedra, Junghyun Lee, Alexandre Proutière, Se-Young Yun

We investigate the problems of model estimation and reward-free learning in episodic Block MDPs.

Best Policy Identification in Linear MDPs

no code implementations11 Aug 2022 Jerome Taupin, Yassir Jedra, Alexandre Proutiere

We investigate the problem of best policy identification in discounted linear Markov Decision Processes in the fixed confidence setting under a generative model.

Learning Optimal Antenna Tilt Control Policies: A Contextual Linear Bandit Approach

no code implementations6 Jan 2022 Filippo Vannella, Alexandre Proutiere, Yassir Jedra, Jaeseong Jeong

In this paper, we devise algorithms learning optimal tilt control policies from existing data (in the so-called passive learning setting) or from data actively generated by the algorithms (the active learning setting).

Active Learning

Minimal Expected Regret in Linear Quadratic Control

no code implementations29 Sep 2021 Yassir Jedra, Alexandre Proutiere

Quantifying the impact of such a constantly-varying control policy on the performance of these estimates and on the regret constitutes one of the technical challenges tackled in this paper.

Optimal Best-arm Identification in Linear Bandits

no code implementations NeurIPS 2020 Yassir Jedra, Alexandre Proutiere

We study the problem of best-arm identification with fixed confidence in stochastic linear bandits.

Finite-time Identification of Stable Linear Systems: Optimality of the Least-Squares Estimator

no code implementations17 Mar 2020 Yassir Jedra, Alexandre Proutiere

We present a new finite-time analysis of the estimation error of the Ordinary Least Squares (OLS) estimator for stable linear time-invariant systems.

Sample Complexity Lower Bounds for Linear System Identification

no code implementations25 Mar 2019 Yassir Jedra, Alexandre Proutiere

For controlled systems, our lower bounds are not as explicit as in the case of uncontrolled systems, but could well provide interesting insights into the design of control policy with minimal sample complexity.

valid

Cannot find the paper you are looking for? You can Submit a new open access paper.