Search Results for author: Ziwei Guan

Found 5 papers, 0 papers with code

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

no code implementations1 Mar 2022 Daouda Sow, Kaiyi Ji, Ziwei Guan, Yingbin Liang

Existing algorithms designed for such a problem were applicable to restricted situations and do not come with a full guarantee of convergence.

Bilevel Optimization Hyperparameter Optimization +2

Faster Algorithm and Sharper Analysis for Constrained Markov Decision Process

no code implementations20 Oct 2021 Tianjiao Li, Ziwei Guan, Shaofeng Zou, Tengyu Xu, Yingbin Liang, Guanghui Lan

Despite the challenge of the nonconcave objective subject to nonconcave constraints, the proposed approach is shown to converge to the global optimum with a complexity of $\tilde{\mathcal O}(1/\epsilon)$ in terms of the optimality gap and the constraint violation, which improves the complexity of the existing primal-dual approach by a factor of $\mathcal O(1/\epsilon)$ \citep{ding2020natural, paternain2019constrained}.

PER-ETD: A Polynomially Efficient Emphatic Temporal Difference Learning Method

no code implementations ICLR 2022 Ziwei Guan, Tengyu Xu, Yingbin Liang

Although ETD has been shown to converge asymptotically to a desirable value function, it is well-known that ETD often encounters a large variance so that its sample complexity can increase exponentially fast with the number of iterations.

When Will Generative Adversarial Imitation Learning Algorithms Attain Global Convergence

no code implementations24 Jun 2020 Ziwei Guan, Tengyu Xu, Yingbin Liang

Generative adversarial imitation learning (GAIL) is a popular inverse reinforcement learning approach for jointly optimizing policy and reward from expert trajectories.

Imitation Learning

Robust Stochastic Bandit Algorithms under Probabilistic Unbounded Adversarial Attack

no code implementations17 Feb 2020 Ziwei Guan, Kaiyi Ji, Donald J Bucci Jr, Timothy Y Hu, Joseph Palombo, Michael Liston, Yingbin Liang

This paper investigates the attack model where an adversary attacks with a certain probability at each round, and its attack value can be arbitrary and unbounded if it attacks.

Adversarial Attack

Cannot find the paper you are looking for? You can Submit a new open access paper.