Search Results for author: Yiding Chen

Found 7 papers, 1 papers with code

Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value

no code implementations • 1 Nov 2023 • Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, Qiaomin Xie

We study the game modification problem, where a benevolent game designer or a malevolent adversary modifies the reward function of a zero-sum Markov game so that a target deterministic or stochastic policy profile becomes the unique Markov perfect Nash equilibrium and has a value within a target range, in a way that minimizes the modification cost.

Paper
Add Code

The Game of Hidden Rules: A New Kind of Benchmark Challenge for Machine Learning

no code implementations • 20 Jul 2022 • Eric Pulick, Shubham Bharti, Yiding Chen, Vladimir Menkov, Yonatan Mintz, Paul Kantor, Vicki M. Bier

Existing benchmark environments for ML, such as board and video games, offer well-defined benchmarks for progress, but constituent tasks are often complex, and it is frequently unclear how task characteristics contribute to overall difficulty for the machine learner.

Paper
Add Code

Byzantine-Robust Online and Offline Distributed Reinforcement Learning

no code implementations • 1 Jun 2022 • Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu

We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Corruption-Robust Offline Reinforcement Learning

no code implementations • 11 Jun 2021 • Xuezhou Zhang, Yiding Chen, Jerry Zhu, Wen Sun

Surprisingly, in this case, the knowledge of $\epsilon$ is necessary, as we show that being adaptive to unknown $\epsilon$ is impossible. This again contrasts with recent results on corruption-robust online RL and implies that robust offline RL is a strictly harder problem.

Adversarial Robustness Offline RL +2

Paper
Add Code

Robust Policy Gradient against Strong Data Corruption

1 code implementation • 11 Feb 2021 • Xuezhou Zhang, Yiding Chen, Xiaojin Zhu, Wen Sun

Our first result shows that no algorithm can find a better than $O(\epsilon)$-optimal policy under our attack model.

Continuous Control

Paper
Code

Error Lower Bounds of Constant Step-size Stochastic Gradient Descent

no code implementations • 18 Oct 2019 • Zhiyan Ding, Yiding Chen, Qin Li, Xiaojin Zhu

To our knowledge, this is the first analysis for SGD error lower bound without the strong convexity assumption.

BIG-bench Machine Learning

Paper
Add Code

Optimal Attack against Autoregressive Models by Manipulating the Environment

no code implementations • 1 Feb 2019 • Yiding Chen, Xiaojin Zhu

In the white-box setting where the attacker knows the environment and forecast models, we present the optimal attack using LQR for linear models, and Model Predictive Control (MPC) for nonlinear models.

Adversarial Attack Model Predictive Control +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.