Search Results for author: Lei Guan

Found 7 papers, 3 papers with code

PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction

1 code implementation1 Dec 2023 Lei Guan, Dongsheng Li, Jiye Liang, Wenjian Wang, Xicheng Lu

The key insight of our proposal is that we employ a weight prediction strategy in the forward pass to ensure that each mini-batch uses consistent and staleness-free weights to compute the forward pass.

Image Classification Machine Translation +1

AdaPlus: Integrating Nesterov Momentum and Precise Stepsize Adjustment on AdamW Basis

1 code implementation5 Sep 2023 Lei Guan

This paper proposes an efficient optimizer called AdaPlus which integrates Nesterov momentum and precise stepsize adjustment on AdamW basis.

Image Classification Language Modelling

XGrad: Boosting Gradient-Based Optimizers With Weight Prediction

1 code implementation26 May 2023 Lei Guan, Dongsheng Li, Yanqi Shi, Jian Meng

the future weights to update the DNN parameters, making the gradient-based optimizer achieve better convergence and generalization compared to the original optimizer without weight prediction.

Weight Prediction Boosts the Convergence of AdamW

no code implementations1 Feb 2023 Lei Guan

In this paper, we introduce weight prediction into the AdamW optimizer to boost its convergence when training the deep neural network (DNN) models.

Image Classification Language Modelling

XPipe: Efficient Pipeline Model Parallelism for Multi-GPU DNN Training

no code implementations24 Oct 2019 Lei Guan, Wotao Yin, Dongsheng Li, Xicheng Lu

It allows the overlapping of the pipelines of multiple micro-batches, including those belonging to different mini-batches.

Non-ergodic Convergence Analysis of Heavy-Ball Algorithms

no code implementations5 Nov 2018 Tao Sun, Penghang Yin, Dongsheng Li, Chun Huang, Lei Guan, Hao Jiang

For objective functions satisfying a relaxed strongly convex condition, the linear convergence is established under weaker assumptions on the step size and inertial parameter than made in the existing literature.

An Efficient ADMM-Based Algorithm to Nonconvex Penalized Support Vector Machines

no code implementations11 Sep 2018 Lei Guan, Linbo Qiao, Dongsheng Li, Tao Sun, Keshi Ge, Xicheng Lu

Support vector machines (SVMs) with sparsity-inducing nonconvex penalties have received considerable attentions for the characteristics of automatic classification and variable selection.

General Classification Variable Selection

Cannot find the paper you are looking for? You can Submit a new open access paper.