no code implementations • 17 Apr 2024 • Yaqi Xie, Will Ma, Linwei Xin
Second, the number of parameters in a policy class may not be the correct measure of overfitting error: in fact, the class of policies defined by T time-varying base-stock levels exhibits a generalization error comparable to that of the two-parameter (s, S) policy class.
no code implementations • 16 Feb 2023 • Omar Besbes, Will Ma, Omar Mouchtaki
This class includes classical policies such as ERM, k-Nearest Neighbors and kernel-based policies.
no code implementations • 14 Oct 2022 • Jiashuo Jiang, Will Ma, Jiawei Zhang
We study the classical Network Revenue Management (NRM) problem with accept/reject decisions and $T$ IID arrivals.
no code implementations • 20 Jun 2022 • Omar Besbes, Will Ma, Omar Mouchtaki
We then leverage this connection to quantify the performance that is achievable by Sample Average Approximation (SAA) as a function of the radius of the heterogeneity ball: for any integral probability metric, we derive bounds depending on the approximation parameter, a notion which quantifies how the interplay between the heterogeneity and the problem structure impacts the performance of SAA.
no code implementations • 18 Sep 2021 • Will Ma, Pan Xu, Yifan Xu
Examples of online and offline agents include keywords (online) and sponsors (offline) in Google Advertising; workers (online) and tasks (offline) in Amazon Mechanical Turk (AMT); riders (online) and drivers (offline when restricted to a short time window) in ridesharing.
no code implementations • NeurIPS 2020 • Christian Tjandraatmadja, Ross Anderson, Joey Huchette, Will Ma, Krunal Patel, Juan Pablo Vielma
We improve the effectiveness of propagation- and linear-optimization-based neural network verification algorithms with a new tightened convex relaxation for ReLU neurons.
no code implementations • 11 Oct 2018 • Wang Chi Cheung, Will Ma, David Simchi-Levi, Xinshang Wang
We overcome both the challenges of model uncertainty and customer heterogeneity by judiciously synthesizing two algorithmic frameworks from the literature: inventory balancing, which "reserves" a portion of each resource for high-reward customer types which could later arrive, and online learning, which shows how to "explore" the resource consumption distributions of each customer type under different actions.