Search Results for author: Peter Glynn

Found 13 papers, 0 papers with code

Optimal Sample Complexity for Average Reward Markov Decision Processes

no code implementations13 Oct 2023 Shengbo Wang, Jose Blanchet, Peter Glynn

In this context, the existing literature provides a sample complexity upper bound of $\widetilde O(|S||A|t_{\text{mix}}^2 \epsilon^{-2})$ and a lower bound of $\Omega(|S||A|t_{\text{mix}} \epsilon^{-2})$.

Optimal Sample Complexity of Reinforcement Learning for Mixing Discounted Markov Decision Processes

no code implementations15 Feb 2023 Shengbo Wang, Jose Blanchet, Peter Glynn

We consider the optimal sample complexity theory of tabular reinforcement learning (RL) for maximizing the infinite horizon discounted reward in a Markov decision process (MDP).

reinforcement-learning Reinforcement Learning (RL)

The Design and Implementation of a Broadly Applicable Algorithm for Optimizing Intra-Day Surgical Scheduling

no code implementations14 Mar 2022 Jin Xie, Teng Zhang, Jose Blanchet, Peter Glynn, Matthew Randolph, David Scheinker

In order for an algorithm to see sustained use, it must be compatible with changes to hospital capacity, patient volumes, and scheduling practices.

Scheduling

Surgical Scheduling via Optimization and Machine Learning with Long-Tailed Data

no code implementations13 Feb 2022 Yuan Shi, Saied Mahdian, Jose Blanchet, Peter Glynn, Andrew Y. Shin, David Scheinker

Using data from cardiovascular surgery patients with long and highly variable post-surgical lengths of stay (LOS), we develop a modeling framework to reduce recovery unit congestion.

BIG-bench Machine Learning Scheduling +1

Optimal $δ$-Correct Best-Arm Selection for Heavy-Tailed Distributions

no code implementations24 Aug 2019 Shubhada Agrawal, Sandeep Juneja, Peter Glynn

We then propose a $\delta$-correct algorithm that matches the lower bound as $\delta$ reduces to zero under the mild restriction that a known bound on the expectation of $(1+\epsilon)^{th}$ moment of the underlying random variables exists, for $\epsilon > 0$.

Recommendation Systems

Optimal Transport Relaxations with Application to Wasserstein GANs

no code implementations7 Jun 2019 Saied Mahdian, Jose Blanchet, Peter Glynn

We propose a family of relaxations of the optimal transport problem which regularize the problem by introducing an additional minimization step over a small region around one of the underlying transporting measures.

Probability Functional Descent: A Unifying Perspective on GANs, Variational Inference, and Reinforcement Learning

no code implementations30 Jan 2019 Casey Chu, Jose Blanchet, Peter Glynn

This paper provides a unifying view of a wide range of problems of interest in machine learning by framing them as the minimization of functionals defined on the space of probability measures.

BIG-bench Machine Learning reinforcement-learning +2

Distributed Asynchronous Optimization with Unbounded Delays: How Slow Can You Go?

no code implementations ICML 2018 Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Peter Glynn, Yinyu Ye, Li-Jia Li, Li Fei-Fei

One of the most widely used optimization methods for large-scale machine learning problems is distributed asynchronous stochastic gradient descent (DASGD).

An Accelerated Approach to Safely and Efficiently Test Pre-Production Autonomous Vehicles on Public Streets

no code implementations5 May 2018 Mansur Arief, Peter Glynn, Ding Zhao

Various automobile and mobility companies, for instance Ford, Uber and Waymo, are currently testing their pre-produced autonomous vehicle (AV) fleets on the public roads.

Autonomous Vehicles

Probabilistic Contraction Analysis of Iterated Random Operators

no code implementations4 Apr 2018 Abhishek Gupta, Rahul Jain, Peter Glynn

In many branches of engineering, Banach contraction mapping theorem is employed to establish the convergence of certain deterministic algorithms.

On the convergence of mirror descent beyond stochastic convex programming

no code implementations18 Jun 2017 Zhengyuan Zhou, Panayotis Mertikopoulos, Nicholas Bambos, Stephen Boyd, Peter Glynn

In this paper, we examine the convergence of mirror descent in a class of stochastic optimization problems that are not necessarily convex (or even quasi-convex), and which we call variationally coherent.

Stochastic Optimization

Statistics of Robust Optimization: A Generalized Empirical Likelihood Approach

no code implementations11 Oct 2016 John Duchi, Peter Glynn, Hongseok Namkoong

We study statistical inference and distributionally robust solution methods for stochastic optimization problems, focusing on confidence intervals for optimal values and solutions that achieve exact coverage asymptotically.

Stochastic Optimization

Selecting the best system and multi-armed bandits

no code implementations16 Jul 2015 Peter Glynn, Sandeep Juneja

Consider the problem of finding a population or a probability distribution amongst many with the largest mean when these means are unknown but population samples can be simulated or otherwise generated.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.