Search Results for author: Nima Hamidi

Found 8 papers, 3 papers with code

The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling

no code implementations • 16 Feb 2021 • Nima Hamidi, Mohsen Bayati

The elliptical potential lemma is a key tool for quantifying uncertainty in estimating parameters of the reward function, but it requires the noise and the prior distributions to be Gaussian.

Decision Making LEMMA +1

Paper
Add Code

Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

1 code implementation • NeurIPS 2020 • Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

We study the structure of regret-minimizing policies in the {\em many-armed} Bayesian multi-armed bandit problem: in particular, with $k$ the number of arms and $T$ the time horizon, we consider the case where $k \geq \sqrt{T}$.

Multi-Armed Bandits

Paper
Code

On Frequentist Regret of Linear Thompson Sampling

no code implementations • 11 Jun 2020 • Nima Hamidi, Mohsen Bayati

This paper studies the stochastic linear bandit problem, where a decision-maker chooses actions from possibly time-dependent sets of vectors in $\mathbb{R}^d$ and receives noisy rewards.

Thompson Sampling

Paper
Add Code

The Unreasonable Effectiveness of Greedy Algorithms in Multi-Armed Bandit with Many Arms

2 code implementations • 24 Feb 2020 • Mohsen Bayati, Nima Hamidi, Ramesh Johari, Khashayar Khosravi

This finding diverges from the notion of free exploration, which relates to covariate variation, as recently discussed in contextual bandit literature.

Multi-Armed Bandits

Paper
Code

A General Theory of the Stochastic Linear Bandit and Its Applications

no code implementations • 12 Feb 2020 • Nima Hamidi, Mohsen Bayati

First, our new notion of optimism in expectation gives rise to a new algorithm, called sieved greedy (SG) that reduces the overexploration problem in OFUL.

Multi-Armed Bandits Thompson Sampling

Paper
Add Code

Personalizing Many Decisions with High-Dimensional Covariates

no code implementations • NeurIPS 2019 • Nima Hamidi, Mohsen Bayati, Kapil Gupta

We consider the k-armed stochastic contextual bandit problem with d dimensional features, when both k and d can be large.

Vocal Bursts Intensity Prediction

Paper
Add Code

Multi-scale Embedded CNN for Music Tagging (MsE-CNN)

no code implementations • 16 Jun 2019 • Nima Hamidi, Mohsen Vahidzadeh, Stephen Baek

Convolutional neural networks (CNN) recently gained notable attraction in a variety of machine learning tasks: including music classification and style tagging.

BIG-bench Machine Learning General Classification +2

Paper
Add Code

On Low-rank Trace Regression under General Sampling Distribution

1 code implementation • 18 Apr 2019 • Nima Hamidi, Mohsen Bayati

In this paper, we study the trace regression when a matrix of parameters B* is estimated via the convex relaxation of a rank-regularized regression or via regularized non-convex optimization.

Matrix Completion Multi-Task Learning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.