Search Results for author: Tesi Xiao

Found 10 papers, 2 papers with code

A Sinkhorn-type Algorithm for Constrained Optimal Transport

no code implementations8 Mar 2024 Xun Tang, Holakou Rahmanian, Michael Shavlovsky, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying

We derive the corresponding entropy regularization formulation and introduce a Sinkhorn-type algorithm for such constrained OT problems supported by theoretical guarantees.

Scheduling

Accelerating Sinkhorn Algorithm with Sparse Newton Iterations

no code implementations20 Jan 2024 Xun Tang, Michael Shavlovsky, Holakou Rahmanian, Elisa Tardini, Kiran Koshy Thekumparampil, Tesi Xiao, Lexing Ying

To achieve possibly super-exponential convergence, we present Sinkhorn-Newton-Sparse (SNS), an extension to the Sinkhorn algorithm, by introducing early stopping for the matrix scaling steps and a second stage featuring a Newton-type subroutine.

Multi-Objective Optimization via Wasserstein-Fisher-Rao Gradient Flow

no code implementations22 Nov 2023 Yinuo Ren, Tesi Xiao, Tanmay Gangwani, Anshuka Rangi, Holakou Rahmanian, Lexing Ying, Subhajit Sanyal

Multi-objective optimization (MOO) aims to optimize multiple, possibly conflicting objectives with widespread applications.

Optimal Algorithms for Stochastic Bilevel Optimization under Relaxed Smoothness Conditions

no code implementations21 Jun 2023 Xuxing Chen, Tesi Xiao, Krishnakumar Balasubramanian

In this paper, we introduce a novel fully single-loop and Hessian-inversion-free algorithmic framework for stochastic bilevel optimization and present a tighter analysis under standard smoothness assumptions (first-order Lipschitzness of the UL function and second-order Lipschitzness of the LL function).

Bilevel Optimization

A One-Sample Decentralized Proximal Algorithm for Non-Convex Stochastic Composite Optimization

1 code implementation20 Feb 2023 Tesi Xiao, Xuxing Chen, Krishnakumar Balasubramanian, Saeed Ghadimi

We focus on decentralized stochastic non-convex optimization, where $n$ agents work together to optimize a composite objective function which is a sum of a smooth term and a non-smooth convex term.

Field-wise Embedding Size Search via Structural Hard Auxiliary Mask Pruning for Click-Through Rate Prediction

no code implementations17 Aug 2022 Tesi Xiao, Xia Xiao, Ming Chen, Youlong Chen

However, most existing NAS-based works suffer from expensive computational costs, the curse of dimensionality of the search space, and the discrepancy between continuous search space and discrete candidate space.

Click-Through Rate Prediction Neural Architecture Search

A Projection-free Algorithm for Constrained Stochastic Multi-level Composition Optimization

no code implementations9 Feb 2022 Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

We propose a projection-free conditional gradient-type algorithm for smooth stochastic multi-level composition optimization, where the objective function is a nested composition of $T$ functions and the constraint set is a closed convex set.

Statistical Inference for Polyak-Ruppert Averaged Zeroth-order Stochastic Gradient Algorithm

no code implementations10 Feb 2021 Yanhao Jin, Tesi Xiao, Krishnakumar Balasubramanian

Statistical machine learning models trained with stochastic gradient algorithms are increasingly being deployed in critical scientific applications.

BIG-bench Machine Learning valid

Improved Complexities for Stochastic Conditional Gradient Methods under Interpolation-like Conditions

no code implementations15 Jun 2020 Tesi Xiao, Krishnakumar Balasubramanian, Saeed Ghadimi

We analyze stochastic conditional gradient methods for constrained optimization problems arising in over-parametrized machine learning.

BIG-bench Machine Learning

Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise

1 code implementation5 Jun 2019 Xuanqing Liu, Tesi Xiao, Si Si, Qin Cao, Sanjiv Kumar, Cho-Jui Hsieh

In this paper, we propose a new continuous neural network framework called Neural Stochastic Differential Equation (Neural SDE) network, which naturally incorporates various commonly used regularization mechanisms based on random noise injection.

Cannot find the paper you are looking for? You can Submit a new open access paper.