Search Results for author: Samy Wu Fung

Found 15 papers, 14 papers with code

Structured World Representations in Maze-Solving Transformers

1 code implementation5 Dec 2023 Michael Igorevich Ivanitskiy, Alex F. Spies, Tilman Räuker, Guillaume Corlouer, Chris Mathwin, Lucia Quirke, Can Rager, Rusheb Shah, Dan Valentine, Cecilia Diniz Behn, Katsumi Inoue, Samy Wu Fung

Transformer models underpin many recent advances in practical machine learning applications, yet understanding their internal behavior continues to elude researchers.

valid

Learning to Solve Integer Linear Programs with Davis-Yin Splitting

2 code implementations31 Jan 2023 Daniel Mckenzie, Samy Wu Fung, Howard Heaton

In many applications, a combinatorial problem must be repeatedly solved with similar, but distinct parameters.

Combinatorial Optimization

Taming Hyperparameter Tuning in Continuous Normalizing Flows Using the JKO Scheme

1 code implementation30 Nov 2022 Alexander Vidal, Samy Wu Fung, Luis Tenorio, Stanley Osher, Levon Nurbekyan

Instead of tuning $\alpha$, we repeatedly solve the optimization problem for a fixed $\alpha$ effectively performing a JKO update with a time-step $\alpha$.

Density Estimation

Explainable AI via Learning to Optimize

no code implementations29 Apr 2022 Howard Heaton, Samy Wu Fung

Indecipherable black boxes are common in machine learning (ML), but applications increasingly require explainable artificial intelligence (XAI).

Explainable artificial intelligence Explainable Artificial Intelligence (XAI)

Feasibility-based Fixed Point Networks

1 code implementation29 Apr 2021 Howard Heaton, Samy Wu Fung, Aviv Gibali, Wotao Yin

This is accomplished using feasibility-based fixed point networks (F-FPNs).

Rolling Shutter Correction

JFB: Jacobian-Free Backpropagation for Implicit Networks

2 code implementations23 Mar 2021 Samy Wu Fung, Howard Heaton, Qiuwei Li, Daniel Mckenzie, Stanley Osher, Wotao Yin

Unlike traditional networks, implicit networks solve a fixed point equation to compute inferences.

A Neural Network Approach Applied to Multi-Agent Optimal Control

1 code implementation9 Nov 2020 Derek Onken, Levon Nurbekyan, Xingjian Li, Samy Wu Fung, Stanley Osher, Lars Ruthotto

Our approach is grid-free and scales efficiently to dimensions where grids become impractical or infeasible.

Optimization and Control

Wasserstein-based Projections with Applications to Inverse Problems

2 code implementations5 Aug 2020 Howard Heaton, Samy Wu Fung, Alex Tong Lin, Stanley Osher, Wotao Yin

To bridge this gap, we present a new algorithm that takes samples from the manifold of true data as input and outputs an approximation of the projection operator onto this manifold.

OT-Flow: Fast and Accurate Continuous Normalizing Flows via Optimal Transport

3 code implementations29 May 2020 Derek Onken, Samy Wu Fung, Xingjian Li, Lars Ruthotto

On five high-dimensional density estimation and generative modeling tasks, OT-Flow performs competitively to state-of-the-art CNFs while on average requiring one-fourth of the number of weights with an 8x speedup in training time and 24x speedup in inference.

Density Estimation

PNKH-B: A Projected Newton-Krylov Method for Large-Scale Bound-Constrained Optimization

1 code implementation27 May 2020 Kelvin Kan, Samy Wu Fung, Lars Ruthotto

We present an interior point method to solve the quadratic projection problem efficiently.

Numerical Analysis Numerical Analysis

Alternating the Population and Control Neural Networks to Solve High-Dimensional Stochastic Mean-Field Games

1 code implementation24 Feb 2020 Alex Tong Lin, Samy Wu Fung, Wuchen Li, Levon Nurbekyan, Stanley J. Osher

By phrasing the problem in this manner, solving the MFG can be interpreted as a special case of training a generative adversarial network (GAN).

Generative Adversarial Network

A Machine Learning Framework for Solving High-Dimensional Mean Field Game and Mean Field Control Problems

1 code implementation4 Dec 2019 Lars Ruthotto, Stanley Osher, Wuchen Li, Levon Nurbekyan, Samy Wu Fung

State-of-the-art numerical methods for solving such problems utilize spatial discretization that leads to a curse-of-dimensionality.

BIG-bench Machine Learning

ADMM-SOFTMAX : An ADMM Approach for Multinomial Logistic Regression

1 code implementation27 Jan 2019 Samy Wu Fung, Sanna Tyrväinen, Lars Ruthotto, Eldad Haber

Solution of the least-squares problem can be be accelerated by pre-computing a factorization or preconditioner, and the separability in the smooth, convex problem can be easily parallelized across examples.

General Classification Image Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.