Search Results for author: Jian Qian

Found 10 papers, 2 papers with code

Online Estimation via Offline Estimation: An Information-Theoretic Framework

no code implementations • 15 Apr 2024 • Dylan J. Foster, Yanjun Han, Jian Qian, Alexander Rakhlin

Our main results settle the statistical and computational complexity of online estimation in this framework.

Paper
Add Code

Byzantine-Robust Federated Linear Bandits

no code implementations • 3 Apr 2022 • Ali Jadbabaie, Haochuan Li, Jian Qian, Yi Tian

In this paper, we study a linear bandit optimization problem in a federated setting where a large collection of distributed agents collaboratively learn a common linear bandit model.

Federated Learning

Paper
Add Code

The Statistical Complexity of Interactive Decision Making

no code implementations • 27 Dec 2021 • Dylan J. Foster, Sham M. Kakade, Jian Qian, Alexander Rakhlin

The main result of this work provides a complexity measure, the Decision-Estimation Coefficient, that is proven to be both necessary and sufficient for sample-efficient interactive learning.

Decision Making reinforcement-learning +1

Paper
Add Code

Robust learning under clean-label attack

no code implementations • 1 Mar 2021 • Avrim Blum, Steve Hanneke, Jian Qian, Han Shao

We study the problem of robust learning under clean-label data-poisoning attacks, where the attacker injects (an arbitrary set of) correctly-labeled examples to the training set to fool the algorithm into making mistakes on specific test instances at test time.

Data Poisoning PAC learning

Paper
Add Code

Stochastic Bandits with Vector Losses: Minimizing $\ell^\infty$-Norm of Relative Losses

no code implementations • 15 Oct 2020 • Xuedong Shang, Han Shao, Jian Qian

We study two goals: (a) finding the arm with the minimum $\ell^\infty$-norm of relative losses with a given confidence level (which refers to fixed-confidence best-arm identification); (b) minimizing the $\ell^\infty$-norm of cumulative relative losses (which refers to regret minimization).

Multi-Armed Bandits Recommendation Systems

Paper
Add Code

Towards Minimax Optimal Reinforcement Learning in Factored Markov Decision Processes

no code implementations • NeurIPS 2020 • Yi Tian, Jian Qian, Suvrit Sra

We study minimax optimal reinforcement learning in episodic factored Markov decision processes (FMDPs), which are MDPs with conditionally independent transition components.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Concentration Inequalities for Multinoulli Random Variables

no code implementations • 30 Jan 2020 • Jian Qian, Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

We investigate concentration inequalities for Dirichlet and Multinomial random variables.

Paper
Add Code

Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs

1 code implementation • NeurIPS 2019 • Jian Qian, Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

The exploration bonus is an effective approach to manage the exploration-exploitation trade-off in Markov Decision Processes (MDPs).

Paper
Code

Importance Resampling for Off-policy Prediction

2 code implementations • NeurIPS 2019 • Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White

Importance sampling (IS) is a common reweighting strategy for off-policy prediction in reinforcement learning.

Paper
Code

Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

no code implementations • 11 Dec 2018 • Jian Qian, Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

We introduce and analyse two algorithms for exploration-exploitation in discrete and continuous Markov Decision Processes (MDPs) based on exploration bonuses.

Efficient Exploration

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.